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Abstract 

To  what  degree  should  societies  allow  inequality  to  be  inherited?  What  role  should  estate 
taxation  play  in  shaping  the  intergenerational  transmission  of  welfare?  We  explore  these  ques- 
tions by  modeling  altruistically-linked  individuals  who  experience  privately  observed  taste  or 
productivity  shocks.  Our  positive  economy  is  identical  to  models  with  infinite- lived  individ- 
uals where  efficiency  requires  immiseration:  inequality  grows  without  bound  and  everyone's 
consumption  converges  to  zero.  However,  under  an  intergenerational  interpretation,  previous 
work  only  characterizes  a  particular  set  of  Pareto-efficient  allocations:  those  that  value  only 
the  initial  generation's  welfare.  We  study  other  efficient  allocations  where  the  social  welfare 
criterion  values  future  generations  directly,  placing  a  positive  weight  on  their  welfare  so  that 
the  effective  social  discount  rate  is  lower  than  the  private  one.  For  any  such  difference  in  social 
and  private  discounting  we  find  that  consumption  exhibits  mean-reversion  and  that  a  steady- 
state,  cross-sectional  distribution  for  consumption  and  welfare  exists,  where  no  one  is  trapped 
at  misery.  The  optimal  allocation  can  then  be  implemented  by  a  combination  of  income  and 
estate  taxation.  We  find  that  the  optimal  estate  tax  is  progressive:  fortunate  parents  face 
higher  average  marginal  tax  rates  on  their  bequests. 

1      Introduction 

Societies  inevitably  choose  the  inheritability  of  inequality.  Some  balance  between  equality  of  oppor- 
tunity for  newborns  and  incentives  for  altruistic  parents  is  struck.  We  explore  how  this  balancing 
act  plays  out  to  determine  long-run  inequality  and  draw  some  novel  implications  for  optimal  estate 
taxation. 


*For  useful  discussions  and  comments  we  thank  Daron  Acemoglu,  Fernando  Alvarez,  George-Marios  Angeletos, 
Abhijit  Banerjee,  Gary  Becker,  Olivier  Blanchard,  Ricardo  Caballero,  Dean  Corbae,  Mikhail  Golosov,  Bengt  Holrn- 
strom,  Narayana  Kocherlakota,  Robert  Lucas,  Casey  Mulligan,  Roger  Myerson,  Chris  Phelan,  Gilles  Saint-Paul, 
Nancy  Stokey.  Jean  Tirole  and  seminar  and  conference  participants  at  Chicago,  Minnesota,  MIT  and  the  Texas 
Monetary  Conference  held  at  the  University  of  Austin  in  honor  of  the  late  Scott  Freeman.  This  work  begun  moti- 
vated by  a  seminar  presentation  of  Chris  Phelan  at  MIT  in  May  2004.  We  also  have  gained  significant  insight  from 
a  manuscript  by  Scott  Freeman  and  Michael  Sadler — we  thank  Dean  Corbae  for  bringing  it  to  our  attention. 


Existing  normative  models  of  inequality  reach  an  extreme  conclusion:  inequality  should  be  per- 
fectly inheritable  and  rise  steadily  without  bound,  with  everyone  converging  to  absolute  misery  and 
a  vanishing  lucky  fraction  to  bliss.  This  immiseration  result  is  robust;  requires  very  weak  assump- 
tions on  preferences  (Phelan,  1998);  and  obtains  invariably  in  partial  equilibrium  (Green,  1987, 
Thomas  and  Worrall,  1990),  in  general  equilibrium  (Atkeson  and  Lucas,  1992),  and  across  environ- 
ments with  moral-hazard  regarding  work  effort  or  with  private  information  regarding  preferences 
or  productivity  (Aiyagari  and  Alvarez,  1995). 1 

We  depart  minimally  from  these  contributions,  adopting  the  same  positive  economic  models, 
but  a  slightly  different  normative  criterion.  In  a  generational  context,  previous  work  with  infinite- 
lived  agents  characterizes  the  instance  where  future  generations  are  not  considered  directly,  but 
only  indirectly  through  the  altruism  of  earlier  ones.  On  the  opposite  side  of  the  spectrum,  Phelan 
(2005)  proposes  a  social  planner  with  equal  weights  on  all  future  generations.  Our  interest  here  is  in 
exploring  a  class  of  Pareto-efficient  allocations  that  take  into  account  the  current  population  along 
with  unborn  future  generations.  We  place  a  positive  and  vanishing  Pareto  weight  on  the  expected 
utility  of  future  generations,  this  leads  effectively  to  a  social  discount  rate  that  is  lower  than  the 
private  one. 

This  relatively  small  change  produces  a  drastically  different  result:  long-run  inequality  remains 
bounded,  a  steady-state,  cross-sectional  distribution  exists  for  consumption  and  welfare,  social  mo- 
bility is  possible  and  everyone  avoids  misery.  Indeed,  welfare  typically  remains  above  an  endogenous 
lower  bound  that  is  strictly  better  than  misery.  This  outcome  holds  however  small  the  difference 
between  social  and  private  discounting,  and  regardless  of  whether  the  source  of  asymmetric  infor- 
mation is  privately  observed  preferences  or  productivity  shocks. 

We  begin  by  modeling  a  positive  economy  that  is  identical  to  the  taste-shock  setup  developed 
by  Atkeson  and  Lucas  (1992).  Each  generation  is  composed  of  a  continuum  of  individuals  who 
live  for  one  period  and  are  altruistic  towards  a  single  descendant.  There  is  a  constant  aggregate 
endowment  of  the  only  consumption  good  in  each  period.  Individuals  are  ex-ante  identical,  but 
experience  idiosyncratic  shocks  to  preferences  that  are  only  privately  observed — thus  ruling  out  first- 
best  allocations.  Feasible  allocations  must  be  incentive  compatible  and  must  satisfy  the  aggregate 
resource  constraint  in  all  periods. 

When  only  the  welfare  of  the  first  generation  is  considered,  the  planning  problem  is  equivalent 
to  that  of  an  economy  with  infinite-lived  individuals.  Intuitively,  immiseration  then  results  from 
the  desire  to  smooth  the  dynastic  consumption  path:  rewards  and  punishments,  required  for  incen- 
tives, are  best  delivered  permanently.  As  a  result,  the  consumption  process  inherits  a  random-walk 
component  that  leads  cross-sectional  inequality  to  grow  endlessly  without  bound.  Infinite  spreading 
of  the  distribution  is  consistent  with  a  constant  aggregate  endowment  only  if  everyone's  consump- 


1Many  find  the  immiseration  result  perplexing  and  some  even  find  it  morally  questionable,  but  it  is  also  incon- 
venient from  a  practical  standpoint.  Long-run  steady-states  often  provide  a  natural  benchmark  to  study  dynamic 
economies,  but  such  long-run  analyses  are  not  possible  for  private-information  economies  without  a  steady-state 
distribution  with  positive  consumption.  This  has  impaired  the  study  of  long-run  implications  of  optimal  taxation, 
so  common  in  the  Ramsey  taxation  literature. 


tion  eventually  converges  to  zero.    Note,  that  as  a  consequence,  no  steady-state,  cross-sectional 
distribution  with  positive  consumption  exists. 

Across  generations,  this  arrangement  requires  a  lock-step  link  between  the  welfare  of  parent  and 
child.  Of  course,  the  perfect  intergenerational  transmission  of  welfare  improves  parental  incentives — 
but  at  the  expense  of  exposing  newborns  to  the  risk  of  their  parent's  luck.  Individuals  would  value 
being  insured  against  the  uncertainty  of  their  family's  fortune — it  is  often  recognized  that  one  of 
the  biggest  risks  one  faces  in  life  is  regarding  the  family  one  is  born  into. 

By  contrast,  it  remains  optimal  to  link  the  fortunes  of  parents  and  children  in  our  model,  but 
no  longer  in  lock-step.  Rewards  and  punishments  are  distributed  over  all  future  descendants,  but 
in  a  front-loaded  manner.  This  creates  a  mean-reverting  tendency  in  consumption — instead  of  a 
random  walk — that  is  strong  enough  to  bound  long-run  inequality.  The  result  is  a  steady-state 
cross- sectional  distribution  for  consumption  and  welfare,  with  no  fraction  of  the  population  stuck 
at  misery. 

We  also  study  a  repeated  Mirrleesian  version  of  our  economy  and  derive  implications  for  optimal 
estate  taxation.  In  this  model,  individuals  have  identical  preferences  with  regard  to  consumption 
and  work  effort,  but  are  heterogenous  in  the  productivity  of  their  work  effort.  Information  about 
productivity  and  work  effort  is  private — only  the  resulting  output  is  publicly  observable.  We  show 
that  the  analysis  from  the  taste-shock  model  carries  over  to  this  setup,  virtually  without  change.  In 
particular,  a  very  similar  Bellman  equation  characterizes  the  solution  to  the  social  planning  prob- 
lem: consumption  exhibits  mean-reversion  and  has  a  steady-state  cross-sectional  distribution.  This 
outcome  highlights  the  fact  that  our  results  do  not  require  any  particular  asymmetry  of  information. 

More  importantly,  the  Mirrleesian  model  offers  new  insights  into  estate  taxation.  Feasible  al- 
locations can  be  implemented  by  combining  income  and  estate  taxes.  Specifically,  we  find  that  a 
progressive  estate  tax,  which  imposes  a  higher  average  marginal  tax  rate  on  the  bequests  of  fortu- 
nate parents,  is  optimal.  This  result  reflects  the  mean-reversion  of  consumption:  more  fortunate 
dynasties,  with  relatively  high  levels  of  current  consumption,  must  have  a  declining  consumption 
path  induced  by  higher  estate  tax  rates  that  lower  the  net  rates  of  return  across  generations. 

Finally,  an  important  methodological  contribution  of  this  paper  is  to  reformulate  the  social 
planning  problem  recursively.  In  doing  so,  we  extend  ideas  introduced  by  Spear  and  Srivastava 
(1987)  to  situations  where  private  and  social  preferences  differ.  Indeed,  we  are  able  to  reduce  the 
dynamic  program  to  a  one-dimensional  state  variable,  and  our  analysis  and  results  heavily  exploit 
the  resulting  Bellman  equation. 

Related  Literature.  Our  paper  is  most  closely  related  to  Phelan  (2005),  who  considered  a 
social  planning  problem  with  no  discounting  of  the  future.  He  shows  that  if  a  steady  state  for  the 
planning  problem  exists  then  it  must  solve  a  static  maximization  problem,  and  that  solutions  to  this 
problem  have  strictly  positive  inequality  and  social  mobility.  Our  paper  establishes  the  existence 
of  a  steady-state  distribution  for  the  planning  problem  for  any  difference  in  social  and  private 
discounting.  Unlike  the  case  with  no  discounting,  there  is  no  associated  static  planning  problem  for 
steady-state  distributions,  and  as  a  result,  the  methods  we  develop  here  are  very  different. 


In  overlapping-generation  models  without  altruistic  links,  all  market  equilibria  that  are  Pareto 
efficient  place  positive  direct  weight  on  future  generations.  Bernheim  (1989)  was  the  first  to  point 
out  that  in  the  dynastic  extension  of  these  models  with  altruism,  many  Pareto  efficient  allocations 
are  not  attainable  by  the  market.  Kaplow  (1995)  argued  that  these  Pareto  efficient  allocations 
are  natural  social  objectives  and  that  they  can  be  implemented  by  market  equilibria  with  estate 
taxation  policy.  The  estate  tax  is  negative — it  is  a  subsidy  —  so  as  to  internalize  the  externality  of 
giving  on  future  generations. 

Our  work  contributes  to  a  large  literature  on  dynamic  economies  with  asymmetric  information. 
In  addition  to  the  work  mentioned  above,  this  includes  recent  research  on  dynamic  optimal  taxa- 
tion (e.g.,  Golosov,  Kocherlakota  and  Tsyvinski,  2003;  Albanesi  and  Sleet,  2004;  and  Kocherlakota, 
2004).  This  application  has  been  handicapped  by  the  immiseration  result  and  by  the  non-existence 
of  a  steady-state  distribution  with  positive  consumption,  making  it  difficult  to  draw  long-run  con- 
clusions for  optimal  taxation.  Our  results  provide  an  encouraging  way  to  overcome  this  problem. 

Our  work  is  also  indirectly  related  to  Sleet  and  Yeltekin  (2004),  who  study  an  Atkeson-Lucas 
environment  with  a  utilitarian  planner,  who  lacks  commitment  and  cares  only  for  the  current 
generation.  In  this  environment,  as  in  Phelan's,  it  is  a  foregone  conclusion  that  immiseration  will 
not  obtain,  so  that  the  interesting  question  is  how  to  solve  for  the  best  subgame-perfect  equilibria. 
Sleet  and  Yeltekin  derive  first-order  conditions  from  a  Lagrangian  and  use  these  to  numerically 
simulate  the  solution.  Interestingly,  it  turns  out  that  the  best  allocation  in  their  no-commitment 
environment  is  asymptotically  equivalent  to  the  optimal  one  with  commitment  but  featuring  a  more 
patient  welfare  criterion.  Thus,  our  own  approach  and  results  provide  an  indirect,  but  effective 
way  of  characterizing  the  no-commitment  problem  and  of  formally  establishing  that  a  steady-state 
distribution  with  no  one  at  misery  exists. 

The  rest  of  the  paper  is  organized  as  follows.  Section  2  introduces  the  economic  environment 
and  sets  up  the  social  planning  problem.  In  Section  3,  we  develop  a  recursive  version  of  the  planning 
problem  and  draw  its  connection  to  our  original  formulation.  The  resulting  Bellman  equation  is 
then  put  to  use  in  Section  4  to  characterize  the  solution  to  the  social  planning  problem.  Here  we 
derive  our  main  results  on  mean-reversion  and  on  the  existence  of  a  steady-state  distribution  for 
consumption.  We  discuss  these  results  in  Section  5  and  develop  intuition  for  them  by  studying 
some  related  problems  and  reformulations.  In  Section  6,  we  turn  to  the  canonical  optimal-taxation 
setup  with  productivity  shocks  and  focus  on  its  implications  for  estate  taxes.  Section  7  offers  some 
conclusions  from  the  analysis.  All  proofs  omitted  in  the  main  text  are  contained  in  the  Appendix. 

2      A  Social  Insurance  Problem 

The  backbone  of  our  model  requires  a  tradeoff  between  insurance  and  incentives.  This  tradeoff 
can  be  due  to  private  information  regarding  either  productivity  or  preferences.  For  purposes  of 
comparison,  we  first  adopt  the  Atkeson-Lucas  taste-shock  specification.  In  Section  6,  we  adapt  our 
arguments  to  a  repeated  Mirrleesian  model  with  privately  observed  productivity  shocks.    Similar 


arguments  could  be  applied  to  moral-hazard  situations  with  unobservable  effort  choices. 

Our  positive  economy  is  identical  to  that  of  Atkeson-Lucas — the  differences  are  only  normative. 
An  infinite-lived  agent  can  be  interpreted  as  a  dynasty  of  individuals  who  have  finite  lives  but  are 
altruistically  linked.  Under  this  interpretation,  Atkeson-Lucas  and  others  focus  on  a  particular 
set  of  efficient  allocations:  those  that  only  directly  consider  the  welfare  of  the  initial  generation.2 
In  contrast,  our  interest  here  lies  with  efficient  allocations  that  directly  weigh  the  welfare  of  all 
future  generations.  This  approach  is  asymptotically  equivalent  to  postulating  preferences  for  an 
infinitely-lived  social  planner  who  is  more  patient  than  individuals. 

Demography,  Preferences  and  Technology.  At  any  point  in  time,  our  economy  is  populated 
by  a  continuum  of  individuals  who  have  identical  preferences,  live  for  one  period,  and  are  replaced 
by  a  single  descendant  in  the  next.  Parents  born  in  period  t  are  altruistic  towards  their  only  child 
and  their  utility  vt  satisfies 

vt  =  Ei_i  [9tu(ct)  +  fivt+i } , 

where  ct  >  0  is  the  parent's  own  consumption  and  P  £  (0, 1)  is  the  altruistic  weight  placed  on  the 
descendant's  utility  vt+\.  The  utility  function  u(c)  is  assumed  continuous  and  concave,  with  a  contin- 
uous derivative  for  all  c  >  0  satisfying  the  Inada  conditions  lim^o  u'(c)  =  oo  and  lim^oo  u'(c)  =  0. 
The  taste  shock  9  e  6  is  distributed  identically  and  independently  across  individuals  and  time. 

This  specification  of  altruism  is  consistent  with  individuals  having  a  preference  over  the  entire 
future  consumption  of  their  dynasty  given  by 

oo 

vt  =  YJP5^t-i[et+su{ct+s)].  (1) 

s=0 

In  each  period,  a  resource  constraint  limits  aggregate  consumption  to  be  no  greater  than  some 
constant  aggregate  endowment  e  >  0.  These  specifications  choices  preferences  and  technology  are 
precisely  those  adopted  by  Atkeson-Lucas. 

Define  U  =  u(M+)  to  be  the  set  of  all  possible  utility  values.  Note  that  we  allow  utility  to 
be  unbounded  so  that  the  extremes  u  =  u(0)  and  u  =  lim^oo  u(c)  may  be  finite  or  infinite.  The 
cost  function  c(u)  is  defined  on  U  as  the  inverse  of  the  utility  function  c  =  u~l .  For  simplicity,  we 
assume  0  contains  a  finite  number  of  shocks  9  =  6-y  <  9<i  <  ■  ■  ■  <  6^  =  6.  We  denote  the  density 
by  p(6)  and  adopt  the  normalization  that  E[0]  =  Yln=i  @nP{9n)  =  1-  The  level  of  dynastic  utility  vt 
then  always  belongs  to  the  set  V  =  w(M+)/(l  —  (3)  with  extremes  v  =  u/(\  —  P)  and  v  =  u/{\  —  /?). 

Social  Welfare.  We  depart  from  Atkeson-Lucas  by  assuming  that  the  social  welfare  criterion  can 
be  represented  by  preferences  given  by  the  utility  function 


j^E-itM*)],  (2) 


t=o 


2The  final  paragraph  in  Atkeson  Lucas  (1992)  discusses  the  possible  importance  of  relaxing  this  assumption. 


with  J3  >  P.  Thus,  social  preferences  are  identical  to  the  individual  preferences  given  by  (1),  except 
for  the  discount  factor. 

This  setup  puts  weight  on  the  welfare  of  future  generations  directly.  Future  generations  are 
already  indirectly  valued  through  the  altruism  of  the  current  generation.  If,  in  addition,  they  are 
also  directly  included  in  the  welfare  function  the  social  discount  factor  must  be  higher  than  p.  To 
see  this,  consider  the  utilitarian  welfare  criterion 

oo  oo 

^a*E_1Ut  =  ^^E_1[^«(ct)]>  (3) 

4=0  t=0 

where  St  =  Pl  +  Pt_1a  +  •  •  •  +  Pa1'1  +  a1.  Then  the  discount  factor  satisfies 


and  social  preferences  are  more  patient.    In  the  limit  8t+i/5t  — >  max{/3,  a},  so  the  welfare  crite- 
rion (3)  approaches  (2)  with  0  =  max{/?,  a}.3'4 

Atkeson-Lucas'  analysis  applies  to  the  case  with  P  =  $,  so  we  focus  on  the  case  where  enough 
weight  is  placed  on  future  generations  to  ensure  that  the  long-run  social  discount  factor  remains 
strictly  higher  than  the  private  one,  $  >  p.  Although  we  adopt  the  preference  in  (2)  directly  for  the 
rest  of  the  paper,  it  is  straightforward  to  adapt  the  arguments  to  the  welfare  criterion  (3).  The  two 
specifications  are  slightly  different  for  any  finite  horizon  but  are  identical  for  the  long-run,  which  is 
our  primary  concern. 

Information  and  Incentives.  Taste  shock  realizations  are  privately  observed  by  individuals  and 
their  descendants.  The  revelation  principle  then  allows  us  to  restrict  our  attention  to  mechanisms 
that  rely  on  truthful  reports  of  these  shocks.  Thus,  each  dynasty  faces  a  sequence  of  consumption 
functions  {q},  where  ct(0t)  represents  an  individual's  consumption  after  reporting  the  history  9t  = 
(6o,  8i, . . .  ,8t).  A  dynasty's  reporting  strategy  a  =  {at}  is  a  sequence  of  functions  at  :  0i+1  — >  0 
that  maps  histories  of  shocks  9t  into  a  current  report  0t.  Any  strategy  a  induces  a  history  of  reports 
a1  :  0£  -*  G*.  We  use  ct*  to  denote  the  truth- telling  strategy  with  a*t(9l)  =  0t  for  all  9l  e  0*. 
Given  an  allocation  {ct},  the  utility  obtained  from  any  reporting  strategy  a  is 

oo 

[/({cj.a;/?)^    J2    /?M*(*W))  PK?)- 

t  =  0    0<£0t+l 


3Bernheim  (1989)  performs  similar  intergenerational  discount  factor  calculations  in  his  welfare  analysis  of  a 
deterministic  dynastic  saving  model.  Caplin  and  Leahy  (2005)  argue  that  these  ideas  also  apply  to  intra-personal 
discounting  within  a  lifetime,  leading  to  a  social  discount  factor  that  is  greater  than  the  private  one  not  only  across 
generations,  but  within  generations  as  well. 

4Onc  can  also  adopt  the  more  general  welfare  criterion  J^^L0  at  E_iut  for  some  sequence  of  positive  Pareto  weights 
{at}.  In  particular,  the  sequence  a0  =  (1  -j3)/(l  -  (i)  and  at  =  ao/3*  for  t  >  1  delivers  St+i/5t  =  ft  for  all  t  =  0, 1, . . . 


An  allocation  {ct}  is  incentive  compatible  if  truth-telling  is  optimal: 

U({ct},a*;P)>U({ct},a;P).  (4) 

Social  Planning  Problem.  Following  Atkeson-Lucas,  we  identify  each  dynasty  with  a  number 
v,  which  we  interpret  as  its  initial  entitlement  to  expected,  discounted  utility,  vq  =  v.  We  assume 
that  all  dynasties  with  the  same  entitlement  v  receive  the  same  treatment.  We  then  let  ip  denote  a 
distribution  of  utilities  v  across  the  population  of  dynasties:  ip(A)  is  the  fraction  of  dynasties  who 
will  receive  expected  discounted  utility  in  the  set  A  C  R. 

An  allocation  is  a  sequence  of  functions  {<:£}  for  each  v,  where  0^(0')  represents  the  consumption 
that  a  dynasty  with  initial  entitlement  v  gets  at  date  t  after  reporting  the  sequence  of  shocks  #*. 
For  any  given  initial  distribution  of  entitlements  ip  and  resources  e,  we  say  that  an  allocation  {cvt} 
is  feasible  if:  (i)  it  is  incentive  compatible  for  all  dynasties;  (ii)  it  delivers  expected  utility  of  at  least 
v  to  all  initial  dynasties  entitled  to  v;  and  (iii)  average  consumption  in  the  population  does  not 
exceed  the  fixed  endowment  e  in  all  periods.  We  let  e*(ip)  denote  the  lowest  resource  level  e  such 
that  there  exists  a  feasible  allocation  that  delivers  the  distribution  of  utility  entitlements  ip — the 
efficiency  problem  studied  by  Atkeson  and  Lucas  (1992)  which  is  relevant  for  /3  =  j3. 

A  social  optimum  maximizes  the  average  social  welfare  function  (2),  weighed  by  ip,  over  all 
feasible  allocations.  That  is,  the  social  planning  problem  given  an  initial  distribution  of  entitlements 
ip  and  an  endowment  level  e  is  to  maximize 


U({cvt},a*,P)diP(v) 

subject  to  v  =  U{{cvt},a*;P)  >  U{{cvt},a; (3)  for  all  v,  and 

/^cf(9')Pr(«')#)<e         £  =  0,1,...  (5) 

J     el 

Our  social  planning  problem  is  well  defined,  with  a  non-empty  constraint  set,  for  all  e  >  e*(ip);  we 
are  interested  in  situations  with  J3  >  (3  and  where  e  >  e*(ip). 

Steady  States.  Our  focus  is  on  distributions  of  utility  entitlements  ip  such  that  the  solution 
to  the  planning  problem  features,  in  each  period,  a  cross-sectional  distribution  of  continuation 
utilities  vt  that  is  also  distributed  according  to  ip.  We  also  require  the  cross-sectional  distribution 
of  consumption  to  replicate  itself  over  time.  We  term  any  initial  distribution  of  entitlements  with 
these  properties  a  steady  state  and  denote  them  by  ip*.  As  we  shall  demonstrate  below,  continuation 
utility  constitutes  a  state  variable  that  follows  a  Markov  process,  and  steady  states  are  then  invariant 
distributions  of  this  process. 

Note  that  in  the  Atkeson-Lucas  case,  with  ft  =  j3,  the  non-existence  of  a  steady  state  with 
positive  consumption  is  a  consequence  of  the  immiseration  result:  starting  from  any  non-trivial 
initial  distribution  ip  and  resources  e*(ip)  the  sequence  of  distributions  converges  weakly  to  the 
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distribution  having  full  mass  at  misery,  with  zero  consumption  for  everyone.  We  seek  non-trivial 
steady  states  ip*  that  exhaust  a  strictly  positive  aggregate  endowment  e  in  all  periods. 

3     A  Bellman  Equation 

In  this  section  we  study  a  relaxed  version  of  the  social  planning  problem  whose  solution  coincides 
with  that  of  the  original  problem  at  steady  states.  The  relaxed  problem  has  two  important  advan- 
tages. First,  the  relaxed  problem  can  be  solved  by  studying  a  set  of  subproblems — one  for  each 
dynasty  with  entitlement  v — which  avoids  the  need  to  keep  track  of  the  entire  population.  Second, 
each  of  these  subproblems  admits  a  simple  recursive  formulation,  which  can  be  characterized  quite 
sharply.  We  believe  that  the  general  approach  we  develop  here  may  be  useful  in  other  contexts. 

Consider  the  relaxed  planning  problem  where  the  sequence  of  resource  constraints  (5)  is  replaced 
by  the  single  intertemporal  condition 

/oo  oo 

^QtJ>^)Pr(^)^)<eX>,  (6) 

for  some  positive  sequence  {Qt}  with  Ym^o  Qt  <  °°-  One  can  interpret  this  problem  as  representing 
a  small  open  economy  facing  intertemporal  prices  {Qt}-  The  relaxed  and  original  versions  of  the 
planning  problem  are  related  in  that  any  solution  to  the  former  which  happens  to  satisfy  the 
resource  constraints  in  (5)  must  also  be  a  solution  to  the  latter.  A  Lagrangian  argument  establishes 
the  converse:  there  must  exist  some  positive  sequence  {Qt}  such  that  the  solution  to  the  original 
planning  problem  also  solves  the  relaxed  problem.  Most  importantly,  any  steady-state  solution  to 
the  relaxed  problem  is  a  steady-state  solution  to  the  original  one. 

Our  focus  on  steady  states  leads  naturally  to  Qt  =  ql  for  some  q  >  0.  Indeed,  steady  states 
are  only  compatible  with  q  =  $,  so  we  adopt  this  value  for  the  relaxed  problem  from  this  point 
forward.  Attaching  a  multiplier  A  >  0  to  the  intertemporal  resource  constraint  (6),  we  can  form 
the  Lagrangian  L  =  J  Lv  dijj(v)  where 


oo 


and  study  the  optimization  of  L  subject  to  v  =  U {{$} ,  a* ;  /?)  >  U {{cvt} ,  a; (3)  for  all  v.  This  is 
equivalent  to  the  pointwise  optimization,  for  each  v,  of  the  subproblem:  k(v)  =  sup  Lv  subject  to 
v  =  U({tf},  a*;/3)  >  U{{cvt),  a;  /?).  Our  first  result  characterizes  this  value  function  and  shows  that 
it  satisfies  a  Bellman  equation. 

Theorem  1   The  value  function  k(v)  is  continuous,  concave,  and  satisfies  the  Bellman  equation 

k(v)  =  maxE[6u(0)  -  \c(u(6))  +  0k(w(9))]  (7) 


subject  to 

v  =  E[9u(6)  +  f3w(9)]  (8) 

9u(9)  +  f3w(9)  >  9u(9')  +  pw{9')        for  all    9, 9'  e  0.  (9) 

This  recursive  formulation  imposes  a  promise-keeping  constraint  (S)  and  an  incentive  con- 
straint (9).  Intuitively,  the  latter  rules  out  one-shot  deviations  from  truth-telling,  guaranteeing 
that  telling  the  truth  today  is  optimal  if  the  truth  is  told  in  future  periods.  Of  course,  this  is 
necessary  to  satisfy  the  full  incentive-compatibility  condition  (4).  Intuitively,  the  rest  is  implicitly 
taken  care  of  in  (7)  by  evaluating  the  value  function  at  the  continuation  utility:  for  any  given 
continuation  value  w(9),  envision  the  planner  in  the  next  period  solving  the  remaining  sequence 
problem  by  selecting  an  entire  allocation  that  is  incentive  compatible  from  then  on.  Then  k(w(9)) 
represents  the  value  to  the  planner  of  this  continuation  allocation.  Taken  together,  a  pair  u(9)  and 
w{9)  that  satisfies  (8)-(9)  pasted  with  the  corresponding  continuation  allocations  for  each  w(9), 
yields  an  allocation  that  satisfies  the  full  incentive-compatibility  (4).  The  objective  function  in  (7) 
then  captures  the  relevant  value  of  allocations  constructed  in  this  way. 

Among  other  things,  Theorem  1  shows  that  the  maximum  in  the  Bellman  equation  (7)  is  at- 
tained. We  let  the  policy  functions  gu(9,v)  and  gw(9,v)  denote  the  unique  solutions  for  u  and  w, 
respectively.  For  any  initial  utility  entitlement  Vq,  an  allocation  {ut}  can  then  be  generated  from 
the  policy  functions  (gu,gw)  by  setting  uq{9q)  =  gu(90,VQ)  initially  and  defining  iit(0')  and  vt+i(9t) 
inductively  for  t  >  1  by  ut(0')  =  gu{9t,vt{9t-1))  and  Vt+^O*)  =  gw(9uvt{9t-1)). 

Our  next  result  elucidates  the  connection  between  allocations  generated  from  the  policy  functions 
in  this  way  and  solutions  to  the  planning  problem. 

Theorem  2  (a)  An  allocation  {ut}  is  optimal  for  the  relaxed  problem,  given  vq,  if  and  only  if  it 
is  generated  by  the  policy  functions  (gu,gw)  starting  at  vq,  is  incentive  compatible,  and  delivers  a 
lifetime  utility  of  Vq;  (b)  an  allocation  {ut}  generated  by  the  policy  functions  (gu,gw),  starting  at 
vq,  has  lmij-.oo/?' E_i?;t(#4-1)  =  0  and  delivers  utility  vq;  (c)  an  allocation  {ut}  generated  by  the 
policy  functions  {gu,gw),  starting  from  vQ,  is  incentive  compatible  if 

limsup  E_1ptvt(at-1{9t-1))  >  0 

for  all  reporting  strategies  a. 

Part  (a)  of  Theorem  2  implies  that  either  the  solution  to  the  relaxed  planning  problem  is 
generated  by  the  policy  functions  of  the  Bellman  equation,  or  there  is  no  solution  at  all.  Parts  (b) 
and  (c)  of  the  theorem  show  that  the  first  case  is  guaranteed  if  we  can  verify  the  limit  condition  in 
part  (c).  The  latter  is  automatically  satisfied  for  all  utility  functions  that  are  bounded  below  and 
can  be  verified  in  many  other  cases  of  interest.0 


a Theorem  2  involves  various  applications  of  versions  of  the  Principle  of  Optimality.   For  example,  for  any  given 
policy  functions  [gu,gw)  and  an  initial  value  vo,  the  individual  dynasty  faces  a  recursive  dynamic  programming 


The  case  with  (3  =  J3  can  be  studied  by  the  same  approach.  Recall  that  the  efficiency  problem 
studied  by  Atkeson  and  Lucas  (1992)  minimizes  resources  e  subject  to  the  sequence  of  resource 
constraints  (5)  and  v  =  U({cvt),  a*;  (3)  >  U  {{cvt} ,  a;  (3)  for  all  v.  Consider  the  relaxed  version 
of  this  problem  that  replaces  the  sequence  of  resource  constraints  with  the  single  intertemporal 
constraint  (G)  for  some  sequence  {Qt}-  Then  if  the  solution  to  this  problem  satisfies  the  resource 
constraints  (5)  it  is  also  a  solution  to  the  original  problem.  Although  no  steady  state  exists  in 
this  case,  with  constant  relative  risk  aversion  utility  functions,  the  relaxed  problem  with  Qt  =  qAL 
characterize  the  original  one,  for  an  appropriately  chosen  value  of  qAL  >  0,  not  necessarily  equal  to 
P. 

Since  the  constraint  (6)  binds,  we  can  take  the  objective  function  for  the  relaxed  problem  as 
the  left  hand  side  of  this  inequality.  This  minimization  can  then  be  done  pointwise:  for  each  v  let 
KAL(v)  =  inf  J2Zo  <&&(&)  subject  to  v  =  U ({<*},  a*;  P)  >  U{{ct},a;  /?).  The  associated  Bellman 
equation  for  this  problem  is  then 

KAL(v)  =  mmE[c(u{d))+qALKAL(w(8))]  (10) 

subject  to  (8)  and  (9).  This  problem  can  be  thought  of  as  the  limiting  version  of  the  0  >  (3  case  as 
A  — >  co  and  where  the  discount  factor  in  the  objective  qAL  is  not  necessarily  0.  Theorems  1  and  2 
also  apply  to  this  problem  and  its  Bellman  equation. 

4      Optimal  Inequality 

In  this  section  we  exploit  the  connection  between  the  Bellman  equation  and  the  planning  problem. 
We  characterize  the  solution  and  derive  a  key  equation  that  illustrates  mean-reverting  forces  in  the 
dynamics  of  consumption.  The  main  result  of  the  section  is  to  establish  that  these  forces  are  strong 
enough  to  imply  the  existence  of  an  invariant  distribution  with  no  misery.  Finally,  we  provide 
sufficient  conditions  to  verify  part  (c)  of  Theorem  2,  and  ensure  that  a  solution  to  the  planning 
problem  exists. 

Mean  Reversion 

We  are  now  in  a  position  to  study  the  Bellman  equation's  optimization  problem.  To  begin,  we 
justify  the  use  of  first-order  conditions  with  the  following  lemma: 

Lemma  1  The  value  function  k(v)  is  strictly  concave  and  differ ~entiable  on  the  interior  of  its  do- 
main, with  limv^vk'(v)  =  -co.    If  utility  is  unbounded  below,  then  limv^yk'(v)  =  1.    Otherwise 


problem  with  state  variable  vt.  Conditions  (8)  and  (9)  then  amount  to  guessing  and  verifying  a  solution  to  the 
Bellman  equation  of  the  agent's  problem — in  particular,  that  the  value  function  that  satisfies  the  Bellman  equation, 
with  truth  telling,  is  the  identity  function.  However,  one  also  needs  to  verify  that  this  value  function  represents  the 
true  optimal  value  for  the  dynasty  from  the  sequential  problem.  This  verification  is  accomplished  by  part  (c)  of 
Theorem  2. 
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limv^y  k'(v )  =  oo. 

Let  A  =  k'(v)  be  the  multiplier  on  the  left-hand  side  of  the  promise-keeping  constraint  (8)  and 
let  jjl{9,9')  be  the  multipliers  on  the  incentive  constraints  (9).  The  first-order  condition  for  u(9)  is 

-  \c' (u(9)))p(6)  -  9Xp(9)  +  ]T  9(i(9, 9')  -  £  9^(9, 9')  <  0, 


with  equality  if  u  (9)  is  interior.  The  solution  for  w(9)  must  be  interior,  given  the  Inada  conditions 
for  k(v)  derived  in  Lemma  1,  and  must  satisfy  the  first-order  condition 

pk'  [w(9))p(9)  -  0Xp{9)  +p$r  p(9,  ff)-PY,  M  °')  =  °- 

e'  e' 

Using  the  envelope  condition  k'(v)  —  A  and  adding  up  across  9,  this  becomes 

YJk'{gw(9,v))p{9)  =  ik\v).  (11) 

eeo  P 

This  key  equation  can  be  represented  in  sequential  notation  as 

Et-l[k'{vt+l(9t))]=ik'(vt(9t-1))  (12) 

where  {vt}  is  generated  by  the  policy  function  gw.  Thus,  {k'(vt)}  is  a  Conditional  Linear  Auto 
Regressive  (hereafter:  CLAR)  Markov  process.  Note  that  we  can  translate  anything  about  the 
process  {k'(vt)}  into  implications  for  the  process  {vt},  since  the  derivative  k'(v)  is  continuous  and 
strictly  decreasing.  Likewise,  using  the  policy  function  gu(9,v),  conclusions  about  the  process  {vt} 
provide  information  about  the  process  for  consumption. 

The  conditional  expectation  in  (12)  illustrates  that  @/0  <  1  creates  a  force  for  mean  reversion 
for  the  process  {k'(vt)}  toward  zero.  Lemma  1  implies  that  the  value  function  k(v)  has  an  interior 
maximum  at  v*  >  v  with  k'(v*)  =  0,  so  reversion  occurs  towards  this  interior  utility  level — away 
from  misery.  This  feature  is  key  to  our  results  on  the  existence  of  invariant  distributions. 

Economically,  the  mean-reversion  equation  itself  embodies  an  interesting  form  of  social  mobility. 
We  can  divide  the  population  into  two  social  hierarchies,  with  mobility  ensured  between  them. 
Descendants  of  individuals  with  current  welfare  above  v*  will  eventually  fall  below  it.  Similarly 
dynasties  initially  entitled  to  welfare  below  v*  are  guaranteed  to  access  levels  above  it.  This  rise 
and  fall  of  families  illustrates  a  strong  intergenerational  mobility  in  the  model. 

In  deriving  this  result,  it  is  important  to  stress  the  role  played  by  the  non-monotonicity  of 
the  value  function  k(v).  Although  mean  reversion  stems  from  ft/ft  <  1  in  equation  (12),  it  is 
non-monotonicity  of  k(v)  that  ensures  that  reversion  is  not  toward  misery.  By  contrast,  in  the 
Atkeson-Lucas  case  a  CLAR  equation  similar  to  (12)  may  hold,  but  the  value  function  in  this 
case  is  monotone  and  reversion  then  occurs  toward  misery.    Indeed,  the  envelope  and  first-order 
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conditions  for  the  Bellman  equation  (10)  yield 

e  qAL 

which  is  similar  to  condition  (12)  when  qAL  >  j3.h  Crucially,  unlike  the  case  with  ft  >  /?,  here 
the  value  function  KAL(v)  is  strictly  increasing,  so  that  KAL(v)  >  0.  Thus,  {KAL{vt)}  is  a  non- 
negative  process,  which  implies  by  the  Martingale  Convergence  Theorem  that  it  must  converge 
almost  surely  (a.s.)  to  some  finite  value.  Since  incentives  must  be  provided  using  continuation 
utilities  gw(9,v)  ^  gw(9,v),  this  rules  out  anything  other  than  KAL(vt)  — »  0  a.s.  Immiseration  then 
follows,  vt  — >  v  and  q  — >  0  a.s.  This  highlights  the  importance  of  the  non-monotonicity  of  the  value 
function  k(v)  for  our  results  in  the  case  of  $  >  (3. 

Our  next  result  pushes  the  characterization  of  reversion  past  the  average  behavior  of  the  {k't} 
process  by  deriving  bounds  for  its  evolution.  These  bounds  are  critical  for  guaranteeing  the  existence 
of  an  invariant  distribution  with  no  mass  at  misery. 

Proposition  1   The  policy  function  gw  (9,  v)  satisfies  the  CLAR  equation  (11).  In  addition: 

(a)  if  utility  is  unbounded  below,  then 

7(1  -  fc»)  +  (l  -  0  <  1  -  *V(0,  v))  <  7(1  -  k'(v))  +  (l  -  £)  (13) 

for  all  6  G  9,  where  the  constants  are  given  by  7  =  (/?//?)  max  {(1  +  6n  —  E[6  <  6n])/9n}  and 
7  =  (P/P)  min  {1  +  9n^  -E[9\9>  9n]/9n_l}. 

—  2<n<N 

(b)  if  utility  is  bounded  below,  then  for  low  enough  values  of  v  such  that  k'(v)  >  1,  we  have 

u(6)    =    u 
w(9)    >    v 
k'(w(9))     =    0/P)k\v) 

for  all  9  e  6.   For  values  of  v  such  that  k'(v)  <  1,  the  lower  bound  in  (13)  holds;  the  upper 
bound  in  (13)  holds  for  sufficiently  high  v. 

Proposition  1  illustrates  a  powerful  tendency  away  from  misery.  For  example,  with  utility 
unbounded  below,  continuation  utility  gw(9,  v)  remains  bounded  even  as  v  — ►  -00.  Thus,  no  matter 
how  much  a  parent  is  supposed  to  be  punished,  his  child  is  always  somewhat  spared. 

6With  logarithmic  utility  qAL  =  /3  yields  a  solution  with  constant  average  consumption.  With  u(c.)  =  cl~a /{I  -a) 
and  a  <  1  the  appropriate  value  of  qAh,  that  yields  constant  consumption,  is  strictly  above  (3. 
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Main  Result:  Existence  of  an  Invariant  Distribution  with  No  Misery 

We  now  state  the  main  result  of  this  section:  if  a  solution  to  the  relaxed  planning  problem  exists, 
then  it  admits  an  invariant  distribution  with  no  misery.  The  proof  of  this  result  relies  on  the 
conditional-expectation  equation  (12)  and  the  bounds  in  Proposition  1.  Thus,  it  makes  use  of  both 
mean-reversion  properties  discussed  in  the  previous  subsection.7 

Proposition  2  The  existence  of  an  invariant  distribution  ip*  with  no  mass  at  misery,  ip*({v})  =  0, 
for  the  Markov  process  {vt}  implied  by  gw  is  guaranteed  if  either:  utility  is  unbounded  below,  utility 
is  bounded  above,  or  7  <  1 . 

Proposition  2,  when  combined  with  part  (a)  of  Theorem  2,  leaves  open  only  two  possibilities:  (i) 
the  relaxed  problem  admits  a  steady-state  invariant  distribution  with  no  misery;  or  (ii)  no  solution 
exists.  This  situation  contrasts  strongly  with  the  Atkeson-Lucas  case,  with  /?  =  /?,  where  a  solution 
exists  but  does  not  admit  a  steady  state,  and  everyone  ends  up  at  misery.  Towards  the  end  of  this 
section  we  show  that  a  solution  to  the  planning  problem  can  be  guaranteed  so  that  case  (i)  holds. 

Our  Bellman  equation  also  provides  an  efficient  method  for  explicitly  solving  the  planning  prob- 
lem. We  illustrate  this  with  two  examples,  one  analytical  and  another  numerical. 

Example  1.  Suppose  utility  is  CRRA  with  a  =  1/2,  so  that  u(c)  =  4c1/2  for  c  >  0  and  c(u)  =  u2/2 
for  u  >  0.  For  f3  =  (3  Atkeson-Lucas  show  that  the  optimum  involves  consumption  inequality 
growing  without  bound  and  leading  to  immiseration. 

Consider  the  relaxed  problem  where  we  ignore  the  non-negativity  constraints  on  u  and  w, 

k(v)  =  maxE[0u(0)  -  \u(6)2  +  J3k(w(6))] , 

u,w 

subject  to  (8)  and  (9).  This  is  a  linear-quadratic  dynamic  programming  problem,  so  it  follows  that 
the  value  function  is  a  quadratic  and  the  policy  functions  are  linear  in  v: 

gu(6,v)     =    7?(0)t;  +  7«(0) 
g"(0tv)    =    7»  +  7oW 

For  taste  shocks  with  sufficiently  small  amplitude  we  can  guarantee,  by  continuity  with  the  deter- 
ministic case  0  =  0,  that  7™(0)  <  1  and  7™(0)  >  0,  implying  a  unique  bounded  ergodic  set  for  utility 
[vl,vh]  with  vL  >  0.  Moreover,  gu(6,v)  >  0  for  v  e  [vl,vh].  Hence,  since  the  planning  problem  is 
convex  and  utility  turns  out  to  be  strictly  positive  at  the  steady  state,  this  solution  does  solves  the 
original  problem  with  non- negativity  constraints  on  u. 


7 When  utility  is  bounded  below,  we  either  require  that  utility  be  bounded  above,  or  that  7  <  1,  which  is  ensured 
for  a  small  dispersion  of  the  shocks,  as  a  simple  way  of  ensuring  that  the  ergodic  set  is  bounded  away  from  misery. 
It  seems  very  plausible,  however,  that  these  conditions  could  be  dispensed  with. 
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Figure  1:  Policy  functions  gw(6,v) 


Figure  2:  Policy  function  gc(9.w) 


Example  2.  To  illustrate  the  numerical  value  of  our  recursive  formulation,  we  compute  the  solution 
for  the  logarithmic  case  with  j3  =  0.9,  J3  =  0.975,  e  =  A"1  =  0.6,  6h  =  1.2,  0,  =  0.75  and  p  =  0.5. 
We  iterated  on  the  Bellman  equation  for  k(v)  until  convergence.8 

Figure  1  plots  the  policy  function  for  continuation  utility  in  consumption-equivalent  units, 
c(v(l  —  (3))  against  c(gw(9,v)(l  —  (3)),  while  Figure  2  does  the  same  for  the  policy  function  for 
consumption,  c(v(l  — /?))  against  gc(9,  v).  Both  policy  functions  are  monotonic  and  smooth.  Figure 
1  reveals  a  unique,  bounded  ergodic  set  for  v.  Note  that  both  policy  functions  become  nearly  flat  for 
low  values  of  v.  This  illustrates  the  result,  discussed  immediately  after  Proposition  1,  that  utility 
is  kept  above  some  endogenous  bound. 

Figure  3  displays  the  steady-state,  cross-sectional  distribution  of  dynastic  utility  measured  in 
consumption-equivalent  units,  c(v(l  —  /?))  implied  by  the  solution  to  the  planning  problem.9  The 
long-run  distribution  has  a  smooth  bell-curve  shape — a  feature  that  must  be  due  to  the  smooth, 
mean-reverting  dynamics  of  the  model,  since  it  cannot  be  a  direct  consequence  of  our  two-point 
distribution  of  taste  shocks.  The  figure  also  shows  the  invariant  distributions  for  various  values  of 


8The  details  of  this  numerical  exercise  where  as  follows:  we  solved  for  u  (0)  as  a  function  of  w  (9)  using  the 
incentive  and  promise- keeping  constraints.  We  then  maximized  over  w(6).  We  employed  a  grid  for  v  defined  in 
terms  of  equally  spaced  consumption-equivalent  units  c((l  —  0)v)  =  {0.01, . . .  ,2}.  Results  with  a  grid  size  of  100 
and  300  were  similar;  we  report  the  latter.  We  used  Matlab's  splines  package  to  interpolate  the  value  function 
and  used  fmincon.m  as  our  optimization  routine  over  w{6).  Our  iterations  were  initialized  with  the  value  function 
corresponding  to  the  feasible  plan  that  features  constant  consumption: 


kQ(v) 


(,(1  _  p)  _  Ac(t;(i  _  ,3)) 
1-0 


We  stopped  the  iterations  when  ||fcn(u)  —  fcn_i(v)||  <  10~10  and  verified  that  the  policy  functions  had  also  converged. 
Note  that  gw(9,v)  is  well  within  the  interior  of  [.01,2],  so  that  the  arbitrary  upper  and  lower  bounds  from  our  grid 
choice  were  not  found  to  be  binding. 

9The  invariant  distribution  was  approximated  by  generated  a  Monte  Carlo  simulation  for  the  dynamics  of  the 
{vt}  process  generated  by  g1" ,  with  an  arbitrary  initial  value  of  vq.  Since  this  process  converges  to  a  unique  invariant 
distribution  ip* ,  starting  from  any  initial  value  of  vo,  the  frequencies  in  a  long  time-series  sample  approach  the 
frequencies  of  ip" .  To  create  the  figure  we  used  Matlab's  Wavelet  Toolbox  to  approximate  the  density  from  the 
simulated  Monte-Carlo  sample. 
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utility  in  consumption  equivalent  units 


Figure  3:  Steady-State  Distributions  of  Dynastic  Utility 

P.  The  degree  of  inequality  appears  to  decrease  with  higher  values  of  /?.  This  outcome  is  suggested 
intuitively  by  the  coefficient  on  the  CLAR  equation  (12)  and  the  discussion  in  Section  5  on  features 
of  the  impulse  response  to  shocks.  These  simulations  also  support  the  natural  conjecture  that  as 
we  approach  the  Atkeson-Lucas  case,  /3  — >  /3,  the  resulting  sequence  of  invariant  distributions  blows 
up,  since  no  steady  state  with  positive  consumption  exists  when  $  =  P- 

We  now  turn  briefly  to  issues  of  uniqueness  and  stability  for  the  invariant  distribution  guaranteed 
by  Proposition  2.  This  question  is  of  economic  interest  because  it  represents  an  even  stronger  notion 
of  social  mobility  than  that  implied  by  the  mean- reversion  condition  (12)  discussed  in  the  previous 
subsection.  That  is,  if  convergence  toward  the  distribution  i/j*  occurs  starting  from  any  initial  utility 
level  vq,  then  the  fortunes  of  distant  descendants — the  distribution  of  their  welfare — is  independent 
of  the  individual's  present  condition.  At  the  optimum,  the  past  always  exerts  some  influence  on  the 
present,  but  its  influence  is  bounded  and  dies  out  over  time,  so  that  the  advantages  or  disadvantages 
of  distant  ancestors  are  eventually  wiped  out. 

Indeed,  under  some  conditions  we  can  guarantee  that  the  social  optimum  in  our  model  does 
display  this  strong  notion  of  social  mobility.  To  see  this,  suppose  the  ergodic  set  for  the  {A;^} 
process  is  compact.  This  is  guaranteed,  for  example,  by  applying  Proposition  1  when  7  <  1.  Then, 
if  the  policy  function  gw(9,v)  is  monotone  in  v,  the  invariant  distribution  ip*  is  unique  and  stable 
in  the  sense  that,  starting  from  any  initial  distribution  ipQ,  the  sequence  of  distributions  {ipt}, 
generated  by  gw,  converges  weakly  to  ip*  ■  This  follows  since  the  conditional-expectation  equation 
(12)  ensures  enough  mixing  to  apply  Hopenhayn-Prescott's  Theorem.10   The  monotonicity  of  the 


'See  pg.  382-383  in  Stokey  and  Lucas  with  Prescott  (1989). 
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policy  functions  for  continuation  utility  w  seems  intuitive  and  plausible,  as  illustrated  by  Examples 
1  and  2.11 

Another  approach  suggests  uniqueness  and  convergence  without  relying  on  monotonicity.  Grun- 
wald,  Hyndman,  Tedesco  and  Tweedie  (1999)  show  that  one-dimensional,  irreducible  Markov  processes 
with  the  Feller  property  that  are  bounded  below  and  satisfy  a  CLAR  condition,  such  as  (12),  have 
a  unique  and  stable  invariant  distribution.  Moreover,  convergence  to  this  distribution  from  any 
initial  distribution  is  fast,  in  the  sense  that  it  occurs  at  the  geometric  rate  @/$.  All  the  require- 
ments of  their  theorem  have  been  verified  already  for  our  model,  except  for  the  technical  condition 
of  irreducibility,  which  is  likely  to  hold  if  we  were  to  assume  that  the  taste  shock  has  a  continuous 
distribution.  We  do  not  pursue  this  formally  other  than  to  note  that  the  forces  for  reversion  in  (12) 
could  be  further  exploited  to  establish  uniqueness  and  convergence. 

Our  focus  on  steady  states,  where  the  distribution  of  utility  entitlements  replicates  itself  over 
time,  has  exploited  the  fact  that  the  relaxed  and  original  planning  problems  must  coincide.  However, 
for  the  logarithmic  utility  case  we  can  do  more  and  characterize  transitional  dynamics. 

Proposition  3  //  utility  is  logarithmic,  then  for  any  initial  distribution  of  utility  entitlements  ip 
there  exists  an  endowment  level  e*(ip)  such  that  the  solution  to  the  original  social  planning  problem 
is  generated  by  the  policy  functions  (gu,gw)  from  the  relaxed  problem  with  Qt  =  ft .  The  function 
e*  is  monotone  increasing,  in  that  if '  ipa  -<  ipb  in  the  sense  of  first- order  stochastic  dominance  then 
e*(i;a)<e*(i;b). 

One  interesting  application  of  this  result  is  to  the  situation  where  the  planning  problem  is 
modified  to  select  the  best  initial  distribution  ip,  instead  of  taking  one  as  given.  Then  all  initial 
dynasties  are  treated  identically  and  started  with  identical  utility  level  v*  solving  k'(v*)  =  0.  The 
optimal  allocation  then  evolves  according  to  the  dynamics  implied  by  the  policy  function  gw(6,v). 
The  cross-sectional  distribution  of  welfare  will  spread  out  from  its  initially  egalitarian  condition  as 
dynasties  experience  varying  luck  in  the  realization  of  their  shocks. 

By  applying  Proposition  3,  convergence  to  a  unique  invariant  distribution  ip*  of  the  Markov 
process  {vt}  implied  by  the  policy  function  gw(9,  v)  takes  on  additional  economic  meaning.  It  implies 
the  stability  of  the  cross-sectional  distributions  of  welfare  and  consumption  in  the  population.  That 
is,  if  the  Markov  process  {vt}  generated  by  gw  is  stable,  then  the  cross-sectional  distributions  of 
welfare  and  consumption  eventually  settle  down  to  the  steady  state. 

As  mentioned  in  Section  3,  for  any  utility  function  specification  one  can  characterize  the  solution 
for  any  (ip,e)  as  the  solution  to  a  relaxed  problem  with  some  sequence  of  prices  {Qt},  that  are  not 
necessarily  exponential.  Proposition  3  identifies  the  distributions  and  endowment  pairs  (ip,  e)  that 
lead  to  exponential  prices  in  the  logarithmic  case.  More  generally,  with  logarithmic  utility  for  any 
pair  (ip,e),  we  can  show  that  Qt  =  ft  +  A/5*  for  some  constant  A.  The  entire  optimal  allocation 
can  then  be  characterized  by  the  policy  functions  from  a  non-stationary  Bellman  equation.  Since 


nIndeed,  for  the  general  case  it  can  be  shown  that  gw(9,v)  is  strictly  increasing  in  v.  However,  although  we  know 
of  no  counterexample,  we  have  not  found  conditions  that  ensure  the  monotonicity  of  gw{9,  v)  for  all  6^0. 
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prices  are  asymptotically  exponential,  in  that  limt_00  P~lQt  =  1,  it  follows  that  long-run  dynamics 
are  always  dominated  by  the  policy  functions  (gu,gw)  from  the  relaxed  problem  with  exponential 
prices  Qt  =  /?*  that  we  have  characterized. 

Sufficient  Conditions  for  Verification 

We  conclude  this  section  by  describing  sufficient  conditions  for  a  solution  to  the  planning  problem 
to  exist  at  the  steady  state  ip*  identified  by  the  policy  functions  in  Proposition  2.  This  involves  two 
steps.  First,  we  establish  that  allocations  generated  by  the  policy  functions  are  indeed  incentive 
compatible  by  verifying  the  condition  in  part  (c)  of  Theorem  2.  Second,  we  verify  that  average 
consumption  is  finite  under  the  invariant  distribution  ip*. 

Lemma  2  The  allocation  generated  from  the  policy  functions  (gu,gw),  starting  from  any  vq,  is 
guaranteed  to  be  incentive  compatible  in  the  following  cases:  (a)  utility  is  bounded  above;  (b)  utility 
is  bounded  below;  (c)  utility  is  logarithmic;  or  (d)  7  <  1  or  7  >  0. 

We  now  find  sufficient  conditions  that  guarantee  average  consumption  is  finite  under  the  invariant 
distribution  ip* .  If  the  ergodic  set  for  utility  v  is  bounded  away  from  the  extremes,  then  consumption 
is  bounded  and  average  consumption  is  trivially  finite.  Even  when  a  bounded  ergodic  set  for  utility 
v  cannot  be  ensured,  finite  average  consumption  can  be  guaranteed  for  a  large  class  of  utility 
functions. 

Lemma  3  Average  consumption  is  finite  under  the  invariant  distribution  ip* 

^Tc(gu(8,v))p(9)diP*(v)<cx> 
e 

if  either  (a)  the  ergodic  set  for  v  is  bounded;  or  (b)  utility  is  such  that  d{u{c))  is  a  convex  function 
of  c. 

Note  that  a  bounded  ergodic  set  is  guaranteed  by  7  <  1,  which  is  ensured  for  taste  shocks 
with  sufficiently  small  amplitude;  and  condition  (b)  holds,  for  example,  for  all  constant  relative  risk 
aversion  utility  functions  with  a  >  1. 

The  value  of  average  consumption  depends  on  the  value  of  A.  For  instance,  in  the  case  of 
constant  relative  risk  aversion  utility,  average  steady  state  consumption  is  a  power  function  of  A, 
and  thus  has  full  range.  In  fact,  in  this  case  the  entire  solution  for  consumption  is  homogenous 
of  degree  one  in  the  value  of  the  endowment  e.  This  ensures  a  steady  state  solution  to  the  social 
planning  problem  for  any  endowment  level. 

5      Discussion:  Mean-Reversion 

This  section  develops  an  intuitive  understanding  of  the  key  mean-reversion  property  discussed  pre- 
viously. We  first  derive  the  impulse  response  of  consumption  to  a  one-time  taste  shock.   We  then 
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revisit  the  full  problem  with  an  alternative  Bellman  equation  that  is  useful  as  a  source  of  intuition. 

Impulse  Response 

Consider  a  version  of  our  model  where  only  the  first  generation  faces  uncertainty.  In  the  first 
period,  there  are  two  possible  values  for  the  taste  shock  60  G  {0l,9h},  but  thereafter  the  economy 
is  deterministic:  6t  =  1  for  t  >  1.  We  compare  this  to  the  case  with  no  uncertainty  in  the  first 
period.  This  allows  us  to  trace  out  the  consumption  response  to  the  taste  shock  over  time.  To 
simplify,  we  adopt  logarithmic  utility. 

We  begin  by  studying  a  subproblem  of  the  deterministic  planning  problem  from  the  second 
generation  onward,  that  is  for  t  =  1,2, —  For  a  given,  promised  continuation  utility  Vj,  the 
planning  problem  is 

oo 

fcdetfa)  =  maxV^'"1  (logQ  -  Ac*)  , 
{a}  7T  v  ' 


t=\ 


4=1 


subject  to 

oo 

The  associated  Bellman  equation  is 

kdet(vt)  =  max  (log(ct)  -  Aq  +  (3kdet(vt+1) 

Ct,Vt+l     \  I 

subject  to  vt  =  log Ct  +  pvt+j.  The  first-order  and  envelope  conditions  imply  that 

*4etK+l)      =      !*£*(*)>  (14) 

ct    =    A-^l-fc^M).  (15) 

Condition  (14)  shows  that  {k'det(vt)}  reverts  geometrically  towards  zero  at  the  rate  (3/$.  This  is 
a  deterministic  version  of  the  conditional-expectation  equation  (12).  In  the  logarithmic  case,  it 
translates  directly  into  consumption  by  the  first-order  condition  (15).  Thus,  consumption  reverts 
back  to  a  common  steady  state  at  the  same  rate;  deviations  from  the  steady-state  level  of  consump- 
tion have  a  half-life  of  (log2(/3 / P))~l ■  Note  that  in  the  Atkeson-Lucas  case  when  social  and  private 
discounting  coincide,  so  that  P  =  $,  consumption  remains  perfectly  constant  after  the  shock  at  its 
new  level  ct  =  c{y-y/{\  —  /?)). 

Turning  to  the  first  generation  at  t  =  0,  the  planning  problem  solves 


maxE[0olog  (co(0))  -  Aocq(0)  +  pkdet(Vl{8))} 


co,vi 


subject  to 


0l  log  (c0(9L))  +  pVl(9L)  >  6L  log  (c0(M)  +  Pv1  (9H). 
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where  we  omit  the  other  incentive  constraint  since  it  is  not  binding  at  the  optimum,  and  the  problem 
is  convex.  At  the  optimum,  c0{6H)  >  c0(9L),  Vi(8H)  <  Vi(6L),  and 

E[k'det(v1(6))]=0, 

implying  that  Vi(6H)  <  v*  <  Vi(9L)  where  k'{v*)  =  0.  Note  that  average  consumption  is  constant 
and  equal  to  A-1  in  all  periods. 

Figure  4  shows  the  consumption  response  to  a  taste  shock  in  the  first  period,  for  subsequent 
periods.  That  is,  we  use  (14)  and  (15)  for  t  >  1  starting  at  v1(9L),  v*  and  v±(6H).  The  effect 
on  consumption  from  the  shock  dies  out  over  time  and  consumption  returns  to  a  common  steady- 
state  level.  Again,  this  illustrates  that  the  influence  of  past  fortunes  eventually  vanishes  for  distant 
descendants.  We  also  plot  the  Atkeson-Lucas  case  with  (3  =  (3,  where  the  luck  of  the  first  generation 
has  a  permanent  impact  on  the  consumption  of  all  descendants. 


Figure  4:  Consumption  path  for  t  >  1  in  response  to  taste  shock  at  t  =  0 

To  provide  incentives  for  the  first  generation,  society  rewards  the  descendants  of  an  individual 
reporting  a  low  taste  shock.  Rewards  can  take  two  forms  and  society  makes  use  of  both.  The 
first  is  standard  and  involves  increased  consumption  spending,  in  present- value  terms.  The  second 
is  more  subtle  and  exploits  differences  in  preferences:  it  allows  an  adjustment  in  the  pattern  of 
consumption,  for  a  given  present  value,  in  the  direction  preferred  by  individuals.12 

Since  individuals  are  more  impatient  than  the  planner,  this  latter  form  of  reward  is  delivered 
by  tilting  the  consumption  profile  toward  the  present.    Similarly,  punishments  involve  tilting  the 


12Somc  readers  may  recognize  this  last  method  as  the  time-honored  system  of  rewards  and  punishments  used  by 
parents  when  conceding  their  child's  favorite  snack  or  reducing  their  TV-time.  In  these  instances,  the  child  values 
some  goods  more  than  the  parent  wishes,  and  the  parent  uses  them  to  provide  incentives. 
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consumption  path  toward  the  future.  In  both  cases,  earlier  consumption  dates  are  used  more 
intensively  to  provide  incentives — rewards  and  punishments  are  front-loaded.  Indeed,  consumption 
returns  to  a  common  steady-state  level  in  the  long-run  regardless  of  the  initial  shock  because 
affecting  the  consumption  of  very  distant  descendants  is  not  an  efficient  way  for  society  to  provide 
incentives  to  the  first  generation. 

Another  Bellman  Equation 

Here  we  develop  another  Bellman  equation  that  holds  for  any  value  of  q  not  necessarily  equal  to  J3. 
This  alternative  formulation  is  useful,  both  as  a  source  of  intuition  and  to  motivate  our  focus  on 

q  =  P- 

Consider  the  following  cost  minimization  problem 

oo 

*(«,«)=  mm  ^g^c^PrC*1), 

(C'}    t=0  gt 

subject  to  the  incentive  compatibility  constraint  U ({ct} ,  a* ;  (3)  >  U({ct},a;  (5)  and 

oo 

4=0 
oo 

t=0 

that  is,  delivering  utility  v  and  v  for  the  planner  and  individual,  respectively.  Then  the  value 
function  must  satisfy  the  Bellman  equation 

K{v,v)  =  maxE\c(u(9))+qK(w(9),w{9))], 

u,w,u>  -* 

subject  to 

v    =    E[6u(6)  +  J3w(6)] 

v    =    E[6u(6)  +  0w(6)] 

and 

6u{9)  +  f3w{9)  >  9u{6')  +  0w{6')        for  all  9, 9'  e  0. 

This  formulation  could  be  used  to  derive  all  of  our  results,  although  the  lower-dimensional  Bellman 
equation  (7)  is  slightly  more  convenient  for  that  purpose.  The  advantage  of  this  cost-minimization 
formulation,  however,  is  that  it  lends  itself  naturally  to  economic  interpretations. 

The  following  story  provides  a  useful  reinterpretation  and  source  for  intuition.  Consider  an 
infinite-lived  household  with  two  members,  husband  and  wife,  and  assume  that  consumption  is 
a  public  good — there  is  no  intra-period  resource  allocation  problem.   However,  husband  and  wife 
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disagree  on  how  to  discount  the  future.  Suppose  the  wife  is  more  patient,  but  only  the  husband 
can  observe  and  report  taste-shock  realizations. 

Then  this  cost-minimization  problem  characterizes  the  constrained  Pareto  problem  for  this 
household,  in  the  sense  that  the  isocost  curve  K(v,  v)  =  K0  represents,  given  resources  K0,  the 
Pareto  frontier  between  husband  and  wife.  The  Pareto  frontier  is  non-standard  in  that  it  is  not 
everywhere  decreasing  and  does  not  represent  the  usual  transfer  of  private  goods  between  two 
agents.  Instead,  it  arises  from  differences  in  preferences  that  generate  a  disagreement  about  the 
optimal  consumption  path  for  the  only  public  good.  Since  disagreement  on  preferences  is  bounded, 
the  Pareto  frontier  is  non-monotone  and  the  highest  possible  utility  for  the  wife  is  attained  for  an 
interior  utility  level  for  the  husband,  where  Ki(v*,v*)  =  0.  Reductions  in  the  husband's  utility  to 
the  left  of  this  point  must  also  decrease  utility  for  the  wife,  for  a  given  level  of  resources. 

The  first-order  conditions  can  be  rearranged  to  deliver 

PK2(v,v)=qK2(w(9),w(9))  (16) 

Kx(w{e),w(0))' 


K1(v,v)=0E 
K2(v,v)  "  p 


K2(w(e),w(6)) 


(17) 


Condition  (16)  can  then  be  used  to  argue  that  a  steady-state  requires  q  =  $.  Indeed,  if  q  <  J3,  then 
{K2t}  would  increase  without  bound;  likewise,  if  q  >  $,  then  {K2t}  decreases  toward  zero.  Both 
situations  clearly  do  not  lend  themselves  to  the  existence  of  an  invariant  distribution  for  (v,v). 
On  the  other  hand,  if  q  =  (3  then  K2(vt,Vt)  is  constant  along  the  optimal  path  and  an  invariant 
distribution  is  possible. 

When  q  =  (3,  the  state  (vt,vt)  moves  along  a  one-dimensional  locus  given  by  K2(v,v)  = 
K2(vq,vq).  Intuitively,  since  no  incentives  are  required  for  the  wife,  she  is  perfectly  insured  in 
the  sense  that  the  marginal  cost  of  delivering  welfare  to  her  is  held  constant  across  time. 

Figure  5  shows  that  the  curve  K2(v,v)  =  K2(v0,i>o)  for  continuation  utilities  cuts  the  isocost 
curves  from  below,  and  cuts  Ki(v,v)  =  0  from  above.  Intuitively,  incentives  require  foregoing 
perfect  insurance  for  the  husband  and  accepting  fluctuations  in  v  as  rewards  and  punishments. 
Starting  from  (v* ,  v*),  rewards  can  be  delivered  in  two  ways.  The  optimum  makes  use  of  both  forms 
of  rewards,  explaining  the  shape  of  the  schedule  for  continuation  utilities. 

The  first  form  of  rewarding  involves  increasing  resources  K,  and  can  be  seen  as  an  upward 
movement  along  the  diagonal  Ki(v,v)  =  0.  However,  the  husband  is  also  rewarded  by  allowing  an 
allocation  of  these  resources  that  is  more  to  his  liking,  which  can  be  represented  as  lateral  movements 
along  the  Pareto  frontier,  which  at  (v*,  v*)  is  horizontally  flat.  The  solution  combines  both  forms  of 
rewards  and,  as  a  result,  (v,v)  travels  along  K2(v,v)  =  K2(v0,v0)  to  the  right  of  K\{y,v)  =  0  and 
above  the  initial  isocost  curve.  Note  that  punishments  will  push  the  agent  on  the  upward  sloping 
section  of  the  Pareto  frontier.  Thus,  ex-ante  efficiency  demands  ex-post  inefficiency.13 


13Returning  to  the  analogy  in  footnote  12:  parents  often  complain  that  the  punishments  they  choose  to  inflict,  on 
their  children  hurt  them  more  than  they  do  their  kids. 
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Figure  5:  Isocost  curves  of  K(v,  v) 

Condition  (17)  is  the  analog  of  the  conditional-expectation  equation  (12)  obtained  from  the 
one-dimensional  Bellman  equation.  Here,  it  implies  that  the  slope  of  the  isocost  curve  (Pareto 
frontier),  —Ki/K2  reverts  geometrically  toward  0.  Thus,  (vt,vt)  moves  along  K2(v,v)  =  K2(vq,vo) 
and  eventually  reverts  toward  (v* ,  v*).  Intuitively,  the  solution  deviates  from  (v*,  v*)  to  provide  the 
husband  with  incentives,  but  it  is  efficient  to  revert  back  to  this  point  of  maximum  efficiency  for 
the  wife:  Patience  ensures  that  the  wife  has  her  way  in  the  long-run. 

6     Estate  Taxation 

We  now  turn  to  a  repeated  Mirrleesian  economy  and  study  optimal  taxation.  In  this  version  of  our 
model,  individuals  have  identical  preferences  over  consumption  and  work  effort  but  are  heterogenous 
regarding  their  labor  productivity,  which  is  privately  observed  by  the  individual  and  independently 
distributed  across  generations  and  dynasties.  We  continue  to  focus  on  the  case  where  the  social 
welfare  criterion  discounts  the  future  at  a  lower  rate  than  individuals. 

Unlike  the  taste-shock  model,  here  even  if  we  were  to  restrict  allocations  to  feature  no  link 
between  parent  and  child,  there  would  still  be  a  non-trivial  planning  problem.  Indeed,  in  each 
period  the  situation  would  then  be  identical  to  the  static,  nonlinear  income  tax  problem  originally 
studied  by  Mirrlees  (1971).  Moreover,  in  the  absence  of  altruism,  so  that  (3  =  0,  the  social  optimum 
actually  coincides  with  this  static  solution.  With  altruism,  however,  we  shall  see  below  that  it  is 
always  optimal  to  link  welfare  across  generations  within  a  dynasty  to  enhance  incentives  for  parents. 

Despite  differences  between  the  Mirrleesian  economy  and  our  taste-shock  model,  our  previous 
analysis  can  be  adapted  virtually  without  change.    In  particular,  a  recursive  representation  can 
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be  derived,  and  the  Bellman  equation  can  be  used  to  characterize  the  solution  and  to  establish 
that  a  steady-state,  invariant  distribution  exists.  This  highlights  the  fact  that  our  model  requires 
asymmetric  information,  but  not  any  particular  form  of  it. 

We  focus  on  an  implementation  of  the  allocation  that  uses  income  and  estate  taxes,  and  derive 
some  interesting  results  for  the  latter.  We  find  that  estate  taxation  should  be  progressive:  more 
fortunate  parents  should  face  a  higher  average  marginal  tax  rate  on  their  bequests.  This  result 
reflects  the  mean  reversion  in  consumption  explained  in  the  previous  section.  A  higher  estate  tax 
ensures  that  the  fortunate  face  a  lower  net  rate  of  return  across  generations,  and  that  consequently 
their  consumption  path  decreases  over  time  toward  the  mean. 

Repeated  Mirrlees:  Productivity  Shocks 

Each  period  of  this  economy  is  identical  to  the  canonical  optimal  taxation  setup  in  Mirrlees  (1971). 
Utility  depends  on  the  level  of  consumption  c  and  work  effort  n.  We  assume  that  individuals  in 
generation  t  have  identical  preferences  that  satisfy 

Vt=Et_1[u(ct)-h(nt)+pVt+1], 

but  differ  regarding  their  productivity  in  translating  work  effort  into  output.  An  individual  with 
productivity  w,  exerting  work  effort  n,  produces  output  y  =  wn.  We  assume  that  productivity  w 
is  independently  and  identically  distributed  across  dynasties  and  generations.  Thus,  the  produc- 
tivity talents  of  parent  and  child  are  unrelated — innate  skills  are  assumed  nonheritable.  Given  this 
assumption,  if  the  optimum  features  intergenerational  transmission  of  welfare,  then  it  represents  a 
social  decision  to  provide  altruistic  parents  with  incentives  in  this  way,  and  not  a  mechanical  result 
originating  from  the  assumed  physical  environment.14 

For  convenience,  we  adopt  the  power  disutility  function  h(n)  =  n1  /j  so  that,  defining  9  =  iy~7, 
we  can  write  total  utility  over  consumption  and  output  as  being  subject  to  taste  shocks 

oo  oo 

Y^pE^u^-eMyt)}  =^^E_1[A(ci-i)-^%i)]  -E_x[0o%o)] 

t=0  t=l 

The  right-hand  side  of  this  equation  leads  to  a  convenient  recursive  representation  of  the  planning 
problem  in  the  continuation  utility  defined  by  vt  =  YlT=o0S  E(-i[«(ct+s-i)  —  6t+sh(yt+s)]  (where  we 
are  abusing  notation  slightly  by  folding  the  /?_1  into  the  definition  of  the  utility  function  u(c)). 

The  resource  constraint  requires  total  consumption  not  to  exceed  total  output  plus  some  fixed 
constant  endowment 

f  Y^ cVt(et) Pr(^) (tyiv)  <  f  J2 yvt(et) Pr(#£) d^(v)  +  e      i  =  0, 1, . . . 


14  As  in  the  taste  shock  model,  here  the  case  with  0  =  j3  leads  to  immiseration.    This  case  has  been  studied  by 
Albanesi  and  Sleet.  (2004),  who  impose  an  exogenous  lower  bound  on  dynastic  welfare  to  circumvent  immiseration. 
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where  individuals  are  indexed  by  their  initial  utility  entitlement  v,  with  distribution  ip  in  the 
population. 

We  continue  to  assume  social  discounting  is  lower  than  private  discounting:  0  >  (3.  The  planning 
problem  is  to  choose  an  allocation  {c^(9t),y^(9t)}  to  maximize  average  social  welfare  subject  to  the 
incentive-compatibility  constraints  and  the  resource  constraints. 

Using  the  last  expression  for  the  utility  function  and  applying  similar  reasoning  as  in  the  taste- 
shock  model  yields  the  Bellman  equation  for  the  associated  relaxed  problem 

k(v)  =  max  E[w_  -  Xc(u_)  -  8h{9)  +  Xy(h{8))  +  J3k(w(8))] 

U-,h,w 

v  =  E[u_  -  8h(6)  +  (3w(6)} 

-8h{d)  +  (3w{6)  >  -9h{9')  +  Pw(8'), 

where  the  function  y(h)  represents  the  inverse  of  the  disutility  function,  y  =  h~l .  The  arguments 
that  justify  the  study  of  this  Bellman  equation,  are  similar  to  those  that  underlie  Theorems  1  and  2 
in  the  context  of  the  taste-shock  model.  The  results  regarding  steady  states  parallel  those  obtained 
previously,  and  imply  that  an  invariant  distribution  exists  with  no  immiseration,  as  in  Proposition  2. 

Implementation  with  Income  and  Estate  Taxation 

Any  allocation  that  is  incentive  compatible  and  feasible  can  be  implemented  by  a  combination  of 
taxes  on  labor  income  and  estates.  Here  we  first  describe  this  implementation,  and  explore  some 
features  of  the  optimal  estate  tax  in  the  next  subsection. 

For  any  incentive-compatible  and  feasible  allocation  {cvt{9i),  y^{9t)}  we  propose  an  implemen- 
tation along  the  lines  of  Kocherlakota  (2004).  In  each  period,  conditional  on  the  history  of  their 
dynasty's  reports  6  and  any  inherited  wealth,  individuals  report  their  current  shock  9t,  produce, 
consume,  pay  taxes  and  bequeath  wealth  subject  to  the  following  set  of  budget  constraints 

ct  +  bt  <  yt(9l)  -  Tt(0<)  +  (1  -  n^R^A-i        t  =  0, 1, . . .  (18) 

where  Rt-i,t  is  the  before-tax  interest  rate  across  generations,  and  initially  6_i  =  0.  Individuals  are 
subject  to  two  forms  of  taxation:  a  labor  income  tax  Tt(9  ),  and  a  proportional  tax  on  inherited 
wealth  i?t_1}46t_i  at  rate  7^(0*). ly 

Given  a  tax  policy  {Tl'(9t))  r^fl*),  yl{9t)})  an  equilibrium  consists  of  a  sequence  of  interest  rates 
{Rt,t+i}]  an  allocation  for  consumption,  labor  income  and  bequests  {cvt{9t))  ^(O1)};  and  a  reporting 
strategy  {cr^(9t)}  such  that:    (i)  {ct,  bt,  crt}  maximize  dynastic  utility  subject  to  (18),  taking  the 


15In  this  formulation,  taxes  are  a  function  of  the  entire  history  of  reports,  and  labor  income  yt  is  mandated  given 
this  history.  However,  if  the  labor  income  histories  y( :  0*  — »  R*  being  implemented  are  invertible,  then  by  the 
taxation  principle  we  can  rewrite  T  and  r  as  functions  of  this  history  of  labor  income  and  avoid  having  to  mandate 
labor  income.  Under  this  arrangement,  individuals  do  not  make  reports  on  their  shocks,  but  instead  simply  choose 
a  budget-feasible  allocation  of  consumption  and  labor  income,  taking  as  given  prices  and  the  tax  system. 
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sequence  of  interest  rates  {Rt^t+i}  and  the  tax  policy  {Tt,  rt,  yt}  as  given;  and  (ii)  the  asset  market 
clears  so  that  fE_i[b^(9t)}  d(p(v)  =  0  for  all  i  =  0, 1, . . .  We  say  that  a  competitive  equilibrium  is 
incentive  compatible  if,  in  addition,  it  induces  truth  telling. 

For  any  feasible,  incentive-compatible  allocation  {c",  y^},  we  construct  an  incentive-compatible 
competitive  equilibrium  with  no  bequests  by  setting  T^{91)  =  yt(#4)  —  ct(9t)  and 

PHt-i,t     u'(cvt{8  )) 

for  any  sequence  of  interest  rates  {i?t_iit}.  These  choices  work  because  the  estate  tax  ensures  that 
for  any  reporting  strategy  cr,  the  resulting  consumption  allocation  {c"(crf (#'))}  with  no  bequests 
b^(9t)  =  0  satisfies  the  consumption  Euler  equation 

U'(c?(aW)-)=/?iWEu'(^ 

Ot+i 

The  labor  income  tax  is  such  that  the  budget  constraints  are  satisfied  with  this  consumption  allo- 
cation and  no  bequests.  Thus,  this  no-bequest  choice  is  optimal  for  the  individual  regardless  of  the 
reporting  strategy  followed.  Since  the  resulting  allocation  is  incentive  compatible,  by  hypothesis,  it 
follows  that  truth  telling  is  optimal.  The  resource  constraints  together  with  the  budget  constraints 
then  ensure  that  the  asset  market  clears.16 

As  noted  above,  in  our  economy  without  capital  only  the  after-tax  interest  rate  matters  so  the 
implementation  allows  any  equilibrium  before-tax  interest  rate  {i2t_1)t}.  In  the  next  subsection, 
we  set  the  interest  rate  to  the  reciprocal  of  the  social  discount  factor,  Rt-i,t  —  $~l-  This  choice  is 
natural  because  it  represents  the  interest  rate  that  would  prevail  at  the  steady  state  in  a  version  of 
our  economy  with  capital. 

Optimal  Progressive  Estate  Taxation 

In  this  subsection  we  derive  an  important  intertemporal  condition  that  must  be  satisfied  by  the 
optimal  allocation.  This  condition  has  interesting  implications  for  the  optimal  estate  tax,  computed 
using  (19)  at  the  optimal  allocation. 

Let  A  be  the  multiplier  on  the  promise-keeping  constraint  and  let  n(6,  9')  represent  the  multipliers 
on  the  incentive  constraints.  Then  the  first-order  conditions  for  u_  and  w(9)  are 

c'(u_)-A-A    =    0 
0V(w(9))p(9)-p\p(e)-J2M6')  +  'Elti0',O)    =    0 


16  Since  the  consumption  Euler  equation  holds  with  equality,  the  same  estate  tax  can  be  used  to  implement  alloca- 
tions with  any  other  bequest  plan  with  income  taxes  that  are  consistent  with  the  budget  constraints. 
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and  the  envelope  condition  is  k'  (v)  —  A.  Together  these  imply 

YJk'{w{e))P{0)  =  ik'(v), 
0  p 

Using  c'(w_)  =  l/u'(c_)  =  A  +  k'(v)  we  arrive  at  the  Modified  Inverse  Euler  equation 


u'(c_)         JY"W)  V^ 

The  left-hand  side  together  with  the  first  term  on  the  right-hand  side  is  the  standard  inverse  Euler 
equation.  The  second  term  on  the  right-hand  side  is  novel,  since  it  is  zero  when  /?  =  (3  and  is  strictly 
negative  when  (3  >  (3.Xl 

In  our  environment,  the  relevant  past  history  is  encoded  in  the  continuation  utility  so  the  estate 
tax  r(^f_1,6,t)  can  actually  be  reexpressed  as  a  function  of  t>t(0t_1)  and  8t.  Abusing  notation  we 
then  denote  the  estate  tax  by  Tt(v,9t).  Since  we  focus  on  the  steady-state,  invariant  distribution, 
we  also  drop  the  time  subscripts  and  write  t(v,9). 

The  average  estate  tax  rate  f(v)  is  then  defined  by 

1  -  f(v)  =  £  (1  -  t(v,6))P(6)  (21) 

e 

Using  the  modified  inverse  Euler  equation  (20)  we  obtain 

f(v)   =   -AV(c>))(J-l) 

In  particular,  this  implies  that  the  average  estate  tax  rate  is  negative,  f(v)  <  0,  so  that  bequests 
are  subsidized.  However,  recall  that  before-tax  interest  rates  are  not  uniquely  determined  in  our 
implementation.  As  a  consequence,  neither  are  the  estate  taxes  computed  by  (19).  With  our 
particular  choice  for  the  before-tax  interest  rate,  however,  the  tax  rates  are  pinned  down  and 
acquires  a  corrective,  Pigouvian  role.  Differences  in  discounting  can  be  interpreted  as  a  form  of 
externalities  from  future  consumption,  and  the  negative  average  tax  can  then  be  seen  as  a  way 
of  countering  these  externalities  as  prescribed  by  Pigou.  In  our  setup  without  capital,  this  result 
depends  on  the  choice  of  the  before-tax  interest  rate.  However,  the  negative  tax  on  estates  would 
be  a  robust  steady-state  outcome  in  a  version  of  our  economy  with  capital. 

In  our  model  it  is  more  interesting  to  understand  how  the  average  tax  varies  with  the  history 
of  past  shocks  encoded  in  the  promised  continuation  utility  v.  The  average  tax  is  an  increasing 
function  of  consumption,  which,  in  turn,  is  an  increasing  function  of  v.  Thus,  estate  taxation  is 
progressive:  the  average  tax  on  transfers  for  more  fortunate  parents  is  higher. 


^This  equation  can  also  be  derived  from  an  elementary  variation  argument.  This  is  done  in  Farhi,  Kocherlakota 
and  Werning  (2005),  who  also  show  that  this  equation,  and  its  implications  for  estate  taxation,  generalize  to  an 
economy  with  capital  and  an  arbitrary  process  for  skills. 
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Proposition  4  In  the  repeated  Mirrlees  economy,  the  optimal  allocation -can  be  implemented  by  a 
combination  of  income  and  estate  taxes.  At  a  steady-state,  invariant  distribution  ip* ,  the  optimal 
average  estate  tax  f(v)  defined  by  (19)  and  (21)  is  increasing  in  promised  continuation  utility  v. 

The  progressivity  of  the  estate  tax  reflects  the  mean-reversion  in  consumption.  The  fortunate 
must  face  lower  net  rates  of  return  so  that  their  consumption  path  decreases  towards  the  mean.18 

7     Conclusions 

Should  privately-felt  parental  altruism  affect  the  social  contract?  If  so,  what  are  the  long-run 
implications  for  inequality?  To  address  these  questions,  we  modeled  a  central  tension  in  society: 
the  tradeoff  between  ensuring  equality  of  opportunity  for  newborns  and  providing  incentives  for 
altruistic  parents. 

Our  model's  answer  is  that  society  should  indeed  exploit  altruism  to  motivate  parents,  linking 
the  welfare  of  children  to  that  of  their  parents.  However,  we  also  find  that  if  we  value  the  welfare 
of  future  generations  directly,  the  inheritability  of  good  or  bad  fortune  should  be  tempered.  This 
produces  a  steady-state  outcome  in  which  welfare  and  consumption  are  mean-reverting,  long-run 
inequality  is  bounded,  social  mobility  is  possible  and  misery  is  avoided  by  everyone. 

What  instruments  should  society  use  to  implement  such  allocations?  For  a  Mirrleesian  version 
of  our  model  we  find  an  important  role  for  the  estate  tax.  The  optimal  tax  on  inheritances  is 
progressive:  more  fortunate  parents  should  face  a  higher  average  marginal  tax  rate  on  their  bequests. 
This  result  illustrates  an  interesting  way  in  which  the  conflict  between  corrective  and  redistributive 
taxation  is  optimally  resolved.  Further  examination  of  other  situations  with  similar  conflicts  remains 
an  interesting  direction  for  future  work.19 

Appendix 

Proof  of  Theorem  1 

Weak  concavity  of  the  value  function  k{v)  follows  because  the  relaxed  sequence  problem  has  a 
concave  objective  and  a  convex  constraint  set.  The  weak  concavity  of  the  value  function  k{v)  implies 
its  continuity  over  the  interior  of  its  domain.  If  utility  is  bounded,  continuity  at  the  extremes  can 
also  be  established  as  follows.  Define  the  first-best  value  function 


k*(v)  =  max^T/^E-x^u^)  -  Ac(ut(0*))  ] 


4=0 


subject  to  v  =  X^o^'^-il^*^*)]-    Then  k*(v)  is  continuous  and  k(v)  <  k*(v),  with  equality 
at  any  finite  extremes  v  and  v.   Then  continuity  of  k(v)  at  finite  extremes  follows.   Thus,  k(v)  is 


18Farhi,  Kocherlakota  and  Werning  (2005)  explore  more  general  versions  of  this  result  and  discuss  other  intuitions. 
19Some  progress  along  these  lines  can  be  found  in  Amador,  Angeletos  and  Werning  (2005). 
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continuous. 

The  constraint  (6)  with  q  =  $  implies  that  utility  and  continuation  utility  are  well-defined. 
Toward  a  contradiction,  suppose 


T 

T— >oo 


lim  ]T/?'£a0tu(ct) 


t=o 

is  not  defined,  for  some  s  >  — 1.  This  implies  that  limx^oo  ^]t=0/?imax{£'s6liu(ct),  0}  =  oo.  Since 
utility  is  concave  9u(c)  <  Ac  +  B  for  some  A,  B  >  0,  so  it  follows  that 

T  T  T 

J^2(3tm&x{Es9tu{ct),0}  <  ^ ]P f? Esq  +  S  <  A^(?Eact  +  B 

t=0  t=0  4=0 

Taking  the  limit  yields  lini7'_00  ^t=0  $*  E_iCf  =  oo.  Since  there  are  finitely  many  histories  0s  £  0S+1 
this  implies  limj^oo  X^t=o^*^-ic<  =  °°-  ^  there  is  a  non-zero  measure  of  such  agents  this  implies 
a  contradiction  of  (6).  Thus,  for  both  the  relaxed  and  unrelaxed  problems  utility  and  continuation 
utility  are  well  defined  given  the  other  constraints  on  the  problem.  This  is  important  for  our 
recursive  formulation  below. 

We  next  prove  two  lemmas  that  imply  the  rest  of  the  theorem.    Consider  the  optimization 
problem  on  the  right  hand  side  of  the  Bellman  equation: 

supE[0u(0)  -  \c(u{9))  +  J3k(w{6))]  (22) 

u,w 

v  =  E[6u(6)  +  0w(6j\  (23) 

6u  (0)  +  pw(6)  >  Qu  (0')  +  pw(6')  for  all  0,  6'  e  0  (24) 

Define  m  =  maxc>Oieee(0w(c)  —  Ac)  and  k  (v)  =  k  (v)  —  m/(l  —  $)  <  0.  The  problem  in  (22)  is 
equivalent  to  the  following  optimization  with  non-positive  objective: 

supE[0u(0)  -Xc(u(9))  -m  +  pk{w{9))}  (25) 

subject  to  (23)  and  (24). 

Lemma  A.l   The  supremum  in  (22),  or  equivalently  (25),  is  attained. 

Proof.  If  utility  is  bounded  the  result  follows  immediately  by  continuity  of  the  objective  function 
and  compactness  of  the  constraint  set.  So  suppose  utility  is  unbounded  above  and  below  —  similar 
arguments  apply  when  utility  is  only  unbounded  below  or  only  unbounded  above.  We  first  show 
that 

lim  k(v)  =    lim    k(v)  =  -co  (26) 

v— *oo  v— >— oo 

and  then  use  this  result  to  restrict,  without  loss,  the  optimization  within  a  compact  set,  ensuring  a 
maximum  is  attained. 
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To  establish  these  limits,  define 

oo 

h(v;  0)  =  sup  J2  Pl  E-i  [QMS*)  -  Ac(u(0*))  -  m 


t=o 


subject  to  v  =  E_i  Ylt^oPt^tu(9t)-  Since  this  corresponds  to  the  same  problem  but  without  the 
incentive  constraints  it  follows  that  k(v)  <  h(v,J3).  If  lim,,-^  h(v,  J3)  =  lim„__00  h(v,  J3)  =  — oo, 
then  the  desired  limits  (26)  follow.  Since  9u  —  Xc(u)  —  ra  <  0  and  ft  <  ft  it  follows  that 

777 

h(v,P)  <  h(v,0)  =  v  -  XC(v,P)  -  — ,  (27) 

where 

oo 

C(u,/?)  =  inf  V^'E_ic(u(0')) 

i=0 

subject  to  v  =  Yl'tLoftt'^-i[6t'u(9t)]-  Note  that  C(v,/3)  is  a  standard  convex  first-best  allocation 
problem,  with  solution  «(#')  =  (c')~1(9t,y(v))  for  some  positive  multiplier  j(v),  increasing  in  v  and 
such  that  lim„__00  ^(v)  =  0  and  limi;_00  7  (v)  =  00.  Then 


C(v,P)  =  T±-E[cfe)-1(61(v)))~ 


1-/3 

so  that  lim^^.oo  h(v,  /3)  =  —00  and  lim^oo  h(v,  (3)  =  —00.  Using  the  inequality  (27)  this  establishes 
linv^-oo  h(v,$)  =  —00  and  hm1,_00  h(v,$)  =  —00,  which,  in  turn,  imply  the  limits  (26). 

Fix  a  v.  Take  any  allocation  that  verifies  the  constraints  (23)  and  (24)  and  let  k  <  00  be 
the  corresponding  value  of  (25).  Then,  since  the  objective  is  non-positive,  we  can  restrict  the 
maximization  to  w{6)  such  that  k(w(9))  >  k/(j3p(9)).  Since  k{w{9))  is  concave  with  the  limits  (26), 
this  defines  a  closed,  bounded  interval  for  w(9),  for  each  9.  It  follows  that  there  exists  Mv>w  <  00 
such  that  we  can  restrict  the  maximization  to  |if(0)|  <  MVyW. 

Similarly,  we  can  restrict  the  maximization  over  u{9)  so  that  9u(9)  —  Xc(u(9))  —  m  >  k/p(9). 
Since  (9u  —  Xc(u))  is  strictly  concave,  with  (9u  —  Xc(u))  — >  —00  when  either  u  — >  00  or  u  — >  —00, 
this  defines  a  closed,  bounded  interval  for  u(9),  for  each  9.  Thus,  there  exists  an  MViU  <  00  such 
that  we  can  restrict  the  maximization  to  \u(9)\  <  Mv>u. 

Hence,  we  can  restrict  the  maximization  in  (25)  to  a  compact  set.  Since  the  objective  function 
is  continuous  over  this  restricted  set,  the  maximum  must  be  attained.    ■ 

Lemma  A. 2   The  value  function  k(v)  satisfies  the  Bellman  equation  (7)-(9). 
Proof.  Suppose  that  for  some  v 

k(v)  >  maxE[0u(0)  -  Xc(u{9))  +  J3k(w(9))] 
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where  the  maximization  is  subject  to  (23)  and  (24).  Then  there  exists  A  >  0  such  that 

k(v)  >  E[6u{9)  -  Xc(u{9))  +  J3k(w(6))]  +  A 
for  all  (u,w)  that  satisfy  (23)  and  (24).  But  then  by  definition 

k{w(6))  >  Y^P^-ilOMO*)  -  Ac(«t(^))] 
for  all  allocations  u  that  yield  w  (9)  and  are  incentive  compatible.  Substituting,  we  find  that 

oo 

k(v)  >  J^p'E^leMe^-xc^tie1))]  +a 

for  all  incentive-compatible  allocations  that  deliver  v,  a  contradiction  with  the  definition  of  k(v). 
Namely,  that  there  should  be  a  plan  with  value  arbitrarily  close  to  k(vo).  We  conclude  that  k(v)  < 
maxu,w  E[6u(9)  -  Xc(u(9))  +  J3k(w(9))]  subject  to  (23  )  and  (24). 

By  definition,  for  every  v  and  e  >  0  there  exists  a  plan  {u^Q1;  v,  e)}  that  is  incentive  compatible 
and  delivers  v  with  value 

oo 

]T  ft  E_j  [OMe1;  v,e)-X  c(u4(^;  v,e))]  >  k(v)  -  e. 
t=o 

Let  (u*{6),w*(9))  E  SirgmaxUiWE[9u(9)  -  Xc(u{9))  +  J3k{w(9))].  Consider  the  plan  u0(9Q)  =  u*(90) 
andut(0')  =  ut-i((0u  . . .  ,9t);w*(60),e)  for  t  >  1.  Then 


oo 

=    E_xk^(0o)-Ac(u*(0o))+^^^Eo[0m^+1(0i+1)-Ac(^+1(0t+1))] 


t=0 


>    maxE[0u(fl)-Au(0)+/3/c(w(#))l  - /k 

Since  e  >  0  was  arbitrary  it  follows  that  fc(u)  >  ma,xu<wE[9u(9)  —  Xc(u(9))  +  J3k(w(9))}  subject  to 
(23)  and  (24). 

Finally,  together  both  inequalities  imply  k(v)  =  maxU}WE[9u(9)  —  Xc(u(6))  +  j3k(w(9))}  subject 
to  (23)  and  (24).    ■ 

Proof  of  Theorem  2 

Part  (a).    Suppose  the  allocation  {ut}  is  generated  by  the  policy  functions  starting  from  v0,  is 
incentive  compatible  and  delivers  lifetime  utility  v0.   After  repeated  substitutions  of  the  Bellman 
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equation  ( / ) ,  we  arrive  at 

T 


k(vQ)  =  Y,Pt^-i{8Met)-\c{ut{dt))}  +  (3   E-1k(vT{eT)).  (28) 

t=o 

Since  k(v0)  is  bounded  above  this  implies  that 

oo 

k(v0)  <  YtP'E-iWMe1)  -  HMO1))}, 


4=0 


so  {ut}  is  optimal,  by  definition  of  k(vo). 

Conversely,  suppose  an  allocation  {ut}  is  optimal  given  v0.    Then,  by  definition  it  must  be 
incentive  compatible  and  deliver  utility  vq.  Define  the  continuation  utility  implicit  in  the  allocation 


w0 


(9Q)  =  J^pt-1Eo[9tut(0t)}- 


t=i 


and  suppose  that  either  u0  (9)  ^  gu  (9;  v0)  or  w0  (9)  ^  gw  (9;  v0),  for  some  9  £  0.  Since  the  original 
plan  {ut}  is  incentive  compatible,  u0  (9)  and  w0  (9)  satisfy  (23)  and  (24).  The  Bellman  equation 
then  implies  that 

k(v0)     =    E[gu(9;v0)-Xc(gu(9;v0))+Pk(gw(9;v0))} 

>  E[u0(9)  -  Xc(u0(9))  +  Pk(wQ(9))] 

oo 

>  E_j  [uo(60)  -  \c(u0(90))]  +  X>*  E-i  [MO')  ~  HMO'))}  ■ 

t=i 

The  first  inequality  follows  since  u0  does  not  maximize  (7),  while  the  second  inequality  follows  the 
definition  of  k(wo(6)).  Thus,  the  allocation  {ut}  cannot  be  optimal,  a  contradiction.  A  similar 
argument  applies  if  the  plan  is  not  generated  by  the  policy  functions  after  some  history  9l  and 
t  >  1.  We  conclude  that  an  optimal  allocation  must  be  generated  from  the  policy  functions. 

Part  (b).  First,  suppose  an  allocation  {ut,  vt]  generated  by  the  policy  functions  (gu,  gw)  starting 
at  Vq  satisfies  limt_oo/?tE_iut(0t)  =  0.  Then,  after  repeated  substitutions  of  (8),  we  obtain 


T 
r 

t=0 


=  Y,Pt^-AetMOt)\  +  PTE^[vT(9T)}.  (29) 


Taking  the  limit  we  get  v0  =  Yl'tLoPt'^,-i[OtM0t)]  so  that  the  allocation  {ut}  delivers  lifetime  utility 
v0.  Next,  we  show  that  for  any  allocation  generated  by  (gu,gw),  starting  from  finite  v0,  we  have 
Hmt-,ooPt^~iVt{9t)  =  0. 

Suppose  utility  is  unbounded  above  and  limsup^^  /?'  E_it>t(6|t)  >  0.  Then  0  >  P  implies  that 
lim supj^oo /5(  E_ivt(9t)  =  oo.   Since  the  value  function  k(v)  is  non-constant,  concave  and  reaches 
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an  interior  maximum,  we  can  bound  the  value  function  so  that  k(v)  <  av  +  b,  with  a  <  0.  Thus, 
limsup/J'E-ifc^fl*))  <alimsup/5tE_1ui(^)  +  6  =  -oo 

t— >oo  t— too 

and  then  (28)  implies  that  k(v0)  =  -co,  a  contradiction  since  there  are  feasible  plans  that  yield 
finite  values.  We  conclude  that  lim sup^,^ /3'  E_iut(0')  <  0. 

Similarly,  suppose  utility  is  unbounded  below  and  that  liminfi_00/3tE_it't(^<)   <   0.     Then 
liminft_0O/3*E_1'Uf(<9<)  =  -co.  Using  k(v)  <  av  +  b,  with  a  >  0,  we  conclude  that 

liminf/^E.i/cfWfl'))  =  -oo 

t— >oo 

implying  k(v0)  =  —  oo,  a  contradiction.  Thus,  we  must  have  liminfi_too/?tE_iU£(^t)  >  0. 

The  two  established  inequalities  imply  lim^oo/?4  E_1ut(6|t)  =  0. 

Part  (c).  Suppose  limsup4_^0O/3tE_ift(a'(^t))  >  0  for  every  reporting  strategy  a.  Then  after 
repeated  substitutions  of  (9), 

T 

v>Y,P'  E-i  [°^t  (*W)]  +  f  E-iVr  {°T{0T))  ■ 
t=o 

implying 

T 

v  >  lim  inf  ]T  0-  E_i  [0tu,  (^(fl*))]  • 
Therefore,  {ut}  is  incentive  compatible,  since  v  is  attainable  with  truth  telling  from  part  (b). 

Proof  of  Lemma  1 

Part  (a)  (Strict  Concavity)  Let  {ut(9t,Vo),vt(9t,v0)}  be  the  plans  generated  from  the  policy 
functions  starting  at  vq  (note:  no  claim  of  incentive  compatibility  is  required).  Take  two  initial 
utility  values  va  and  Vb,  with  va  ^  v^.  Define  the  average  utilities 

<(0')    =    aut{9\va)  +  (\  -  a^O1  ,vb) 
v?(9l)    =    avti^Va) +  (1-0^(9*;^) 

Theorem  2  part  (b)  implies  that  {«j(#\fa)}  and  {«((#*,  i>b)}  deliver  va  and  vb,  respectively.  This 
immediately  implies  that  {u°(9t)}  delivers  initial  utility  va  =  ava  +  (1  —  a)vb.  It  also  implies  that 
there  exists  a  finite  time  T  such  that 


]T  0*  E_i  [6tut (9l;  va)}y^J2  p*  E_!  [OMB*;  vb)} . 

i=0  t=o 
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so  that 

ut(9t;va)^ut(6t-vb),  (30) 

for  some  history  6>*  e  0i+1.  Consider  iterating  T  times  on  the  Bellman  equations  starting  from  va 
and  Vf,: 

T 

k{va)     =    £]/?*£_!  [0^(0*;^)  -c(ut(0';Ua))]  +^TE_1fe(uT(0T;Ua)) 

4=0 

T 

fc(v6)    =    £)i&'E_1ptwt(e*;«i,)-c(Tit(fit;^))]+)&rE_1A:(wr^:r;«6)), 

and  averaging  we  obtain 


t=o 


ak(va)  +  (l-a)k(vb)    =    ^^E.!^^4)-  [ac(^(0i;t;Q))  +  (l-a)c(Ut(^;^))]] 

4=0 

+£T  E_x  [ak(vT(9T;  va))  +  (1  -  a)^r(#r;  «>))] 

T 

J^'E^u?^)  -Qc^f^*))]  +  0TE_lk(v$(9T)) 


T 
< 

t=0 


where  the  strict  inequality  follows  from  the  strict  concavity  of  the  cost  function  c  (it) ,  the  fact 
that  we  have  the  inequality  (3(1),  and  the  weak  concavity  of  the  value  function  k.  The  last  weak 
inequality  follows  from  iterating  on  the  Bellman  equation  for  va  since  the  average  plan  (ua,va) 
satisfies  the  Bellman  equations  constraints  at  every  step.  This  proves  that  the  value  function  k(v) 
is  strictly  concave. 

(b)  (Differentiability)  Since  the  value  function  k(v)  is  concave,  it  is  sub-differentiable — that 
is,  there  is  at  least  one  sub-gradient  at  every  point  v.  Differentiability  can  then  be  established  by 
the  following  variational  envelope  arguments. 

Suppose  first  that  utility  is  unbounded  below.  Fix  an  interior  value  v0  for  initial  utility.  For  a 
neighborhood  around  vo  define  the  test  function 

W(v)  =  E  [9(gu(9,  v0)  +  (v-  vQ))  -  Xc(gu(9,  v0)  +  (v  -  v0))  +  0k (g™ (6,  v0))] . 

Since  W(v)  is  the  value  of  a  feasible  allocation  in  the  neighborhood  of  i>0  it  follows  that  W(v)  <  k(v), 
with  equality  at  v0.  Since  W'(vq)  exists  it  follows,  by  application  of  the  Benveniste-Scheinkman 
Theorem  (see  Theorem  4.10,  in  Stokey,  Lucas  and  Prescott,  1989),  that  k'(v0)  also  exists  and 

k'(v0)  =  W'(vQ)  =  1  -  A  E[c'(u*(9))}.  (31) 

Finally,  since  c'(u)  >  0  this  shows  that  k'(v)  <  1.  The  limit  lim^-oo  k'(v)  =  1  is  inherited 
by  the  upper  bound  k(v)   <  h(v,0)  +  m/(l  —  J3)  introduced  in  the  proof  of  Theorem  1,  since 
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Urn,,—.*,  Jj;/l(u,/?)  =  1. 

The  limit  lim„_^c  k'(v)  =  —  oo  follows  immediately  from  lim^^  k(v)  =  —  oo,  if  u  <  oo.  Otherwise 
it  is  inherited  by  the  upper  bound  k(v)  <  h(v,P)  +  m/(l  —  $)  introduced  in  the  proof  of  Theorem  1, 
since. lim^oo  -^h(v,P)  =  -oo. 

Next,  suppose  utility  is  bounded  below,  and  without  loss  in  generality  suppose  that  the  utility 
of  zero  consumption  is  zero.  Then  the  argument  above  establishes  differentiability  at  a  point  v0  as 
long  as  gu(9,v0)  >  0,  for  all  9  G  0.  However,  corner  solutions  with  gu(9,vo)  =  0  are  possible  here 
even  with  Inada  assumption  on  the  utility  function,  so  a  different  envelope  argument  is  required. 
We  provide  one  that  exploits  the  homogeneity  of  the  constraint  set. 

If  utility  is  bounded  below,  then  limsup^^  E-iPtvt(a(9t))  >  0  for  all  reporting  strategies  a  so 
that,  applying  Theorem  2,  a  solution  {ut}  to  the  planner's  sequence  problem  is  ensured.  Then,  for 
any  interior  t;0,  the  plan  {{v/va)ut}  is  incentive  compatible  and  attains  value  v  for  the  agent.  In 
addition  the  test  function 


W(v)  =  J2~PtE- 


f=0 


9t-ut{9t)-\c(-ut{9t) 
v0  \v0 


satisfies  W(v)  <  k(v),  W(vq)  =  fc(i>o)  and  is  differentiate.  It  follows  from  the  Benveniste- 
Scheinkman  Theorem,  that  k'(vo)  exists  and  equals  W'(v0). 

The  proof  of  lim^s  k'(v)  =  — oo  is  the  same  as  in  the  case  with  utility  unbounded  below.  Finally, 
we  show  that  lim^^^  k'(v)  =  oo.  Consider  the  deterministic  planning  problem 

oo 

k(v)  =  meix2_]p{ut  ~  Ac(ut)) 
t=o 

subject  to  v  =  Yl'tLo  Ptut-  Note  that  k(v)  is  differentiable  with  \imv->utf(v)  =  oo.  Since  deterministic 
plans  are  trivially  incentive  compatible,  it  follows  that  k(v)  <  k(v),  with  equality  at  v.  Then  we 
must  have  limv-,uk'(v)  =  oo  to  avoid  a  contradiction. 

Proof  of  Proposition  1 

The  CLAR  equation  was  shown  in  the  main  text,  so  we  focus  here  on  the  bounds.  Consider  the 
program 

maxVpn{0nun  -c(un)  +(3k(wn)} 

ii.  in      <       ■ 


n 


V  =  Y^Pn(9nUn  +  P^n) 
n 

9nun  +  Pwn  >  9nun+1  +  Pwn+X  for  n  =  1, 2, . . . ,  K  -  1, 

This  problem  and  its  notation  require  some  discussion.  We  do  not  incorporate  the  monotonicity 
constraint  on  u.  But  this  notation  allows  us  to  consider  bunching  in  the  following  way.  If  any  set 
of  neighboring  agents  is  bunched,  then  we  group  these  agents  under  a  single  index  and  let  pn  be 
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the  total  probability  of  this  group.  Likewise  let  9n  represent  the  conditional  average  of  0  within 
this  group,  which  is  what  is  relevant  for  the  promise-keeping  constraint  and  the  objective  function. 
Let  9n  be  the  taste  shock  of  the  highest  agent  in  the  group.  The  incentive  constraint  must  rule  the 
highest  agent  in  each  group  from  deviating  and  taking  the  allocation  of  the  group  above  him. 

Of  course,  every  combination  of  bunched  agents  leads  to  a  different  program.  We  study  all  of 
them.  The  optimal  allocation  of  our  problem  must  solve  one  of  these  programs,  although  not  nec- 
essarily the  one  that  yields  the  highest  value,  since  this  one  may  not  be  feasible  if  the  monotonicity 
condition  is  violated. 

The  first-order  conditions  are 

Pn{6n  -  Xc'{un)  -  A0n}  +  Qn\in  -  0n_1/un_1  <  0 

Pn0k'(wn)  -  PX}  +  /?(//„  -  Mn-:)  =  0 

where,  by  the  Envelope  theorem,  A  =  k'(v). 

Consider  first  case  with  utility  unbounded  below,  so  that  the  first  order  condition  for  consump- 
tion holds  with  equality.  Summing  the  first-order  conditions  for  consumption,  we  get 

\E[c'(u(6))}  =l-k'{v) 


The  first-order  conditions  for  n  =  1  imply 


This  implies 


Using 


we  get 


OiK         Xd(Ul)       XE[c'(ue)]       1-A 

1       '     0i  pi         h  h  e1 


^<^-(i-a£, 


p     PPI 


fc'M  > 


p 
p 


-^"-4 


8 

1 

01 1 

,//      N           P 

"0i 

1  1 

l  +  7T~ 

k'{v)  +  £ 

—  — 

P 

01 

0i  J 

P 

A 

0i  J 

Similarly,  writing  the  first-order  conditions  for  n  =  K ,  we  get 


(1      x)      Ok-i  /%-i  =  ^  (uk)  >  XE[d(ug)]       1-A 


This  implies 


6K     Pk  0 

V-K-l   >  1  ~X 
Pk     ~  Ok-i 


K 


0 


-(1-A); 


K 


TK 


7K-1 


0 


K 
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Using 


we  get 


k  (wK)  =  —  A  +  -  — — 
0        P   Pk 


k'  (%)  <   -r 


A_I^  +  (1_A)A 


@K-\  &k-i 

For  any  n,  wk  <  wn  <  Wi,  we  have  for  every  n 


7K 


'K-l 


®k-\ 


k'(v) 


I 

P 


~6k 


1 


6k-i      Ok-i 


P 


1       Qx 

1  +  e;-o; 


k'(y)  +  I 

PY 


ft     1 

0i    ex 


<    k'(wn) 


< 


P 
P 


1  + 


1 


e 


K 


®K-\         @K-\ 


k'{v)  + 


1K 


TK-\ 


@K-1 


After  rearranging,  we  obtain 


P 
P 


1  + 


Oi 


P 


(l-fc»)  +  l-|    >    l-k'(gw(9,v)) 


> 


P 
P 


1  + 


1 


e 


K 


Qk-\      & 


K-\ 


(i  -  k'(v)) +i-L 


To  arrive  at  the  expression  in  the  text  we  take  the  worst  case  scenario:  we  choose  the  subproblem 
that  is  most  unfavorable  to  each  bound,  noting  that  1  —  k'{y)  >  0. 

Turning  to  the  bounded  utility  case,  note  that  all  the  first-order  conditions  and  constraints  are 
satisfied  when  A  >  1  with  /in  =  0  and  u(9)  =  u  and  w  (6)  =  P~lv  >  v.  The  first-order  condition  for 
w  implies  k'(w(9))  =  k'(0~  v)  =  (P/P)k'(v).  Since  the  problem  is  strictly  convex,  this  represents 
the  unique  solution.  Recall  that  in  the  arguments  above  establishing  the  lower  bound  involved  no 
assumption  on  interior  solutions  for  u,  so  this  holds  for  all  v.  The  upper  bound,  on  the  other  hand, 
did  require  u{9)  >  u  for  all  9,  which  must  be  true  for  high  enough  v,  i.e.  for  low  enough  k'(v). 


Proof  of  Proposition  2 

Consider  first  the  case  with  utility  unbounded  below.  Since  the  derivative  k'(v)  is  continuous  and 
strictly  decreasing,  we  can  define  the  transition  function 

Q(x,9)  =  k'{gw((k')-1{x),9)) 

for  all  x  <  1  if  utility  is  unbounded  below.    For  any  probability  distribution  fj,,  let  Tq(/i)  be  the 
probability  distribution  defined  by 


tq(v)(A)  =  /  l{Q{Xto)eA)dn(x)dp(6) 
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for  any  Borel  set  A.  Define 

rq  +  7g  +  ---+73 
TQ'n  =  n 

For  example,  TqiTL{5x)  is  the  empirical  average  of  {k'(vt)}?=1  over  all  histories  of  length  n  starting 

with  k'(v0)  =  x.    The  following  lemma  establishes  the  existence  of  an  invariant  distribution  by 

considering  the  limits  of  {TqiTI}. 

Lemma  A. 3  If  utility  is  unbounded  below,  then  for  each  x  <  1  there  exists  a  subsequence  {Tq^u)(5x)} 
that  converges  weakly,  i.e.  in  distribution,  to  an  invariant  distribution  on  (—00, 1)  under  Q. 

Proof.  The  bounds  (13)  derived  in  Proposition  1  imply  that  for  all  9  6  0 

lim<2(z,0)  =    lim   k'(gw{9,v))  =  0/J3  <  1. 

We  first  extend  the  continuous  transition  function  Q(x,  6) :  (—00,  l)x0->  (—00, 1)  to  a  continuous 
transition  function  Q(x,9):  (-00, 1]  x0-t  (— 00, 1),  with  Q(l,8)  =  P//3  and  Q(x,6)  =  Q(x,8), 
for  all  x  €  (—00, 1).  It  follows  that  TU  maps  probability  distributions  over  (  —  00, 1]  to  probability 
distributions  over  (—00, 1),  and  Tq(Sx)  =  Tq(5x),  for  all  x  €  (—00, 1). 

We  next  show  that  the  sequence  {Tqu(Sx)}  is  tight,  in  that  for  any  e  >  0  there  exists  a  compact 
set  Ae  such  that  TU  (5X)(/LJ  >  1  —  e,  for  all  n.  The  expected  value  of  the  distribution  T?(6X) 
is  simply  E-Wivtie*'1))]  with  x  =  k'(v0)  <  1.  Recall  that  E-i^Mfl*-1))]  =  (P / J3)1  k' {v0)  -»  0. 
This  implies  that 

min{0,fc'(^o)}     <    E.!^'^^*-1))] 

I    ?3(<y  (-00,  -A)(-A)  +  (1  -  T?(5x)(-oo,  -A))l 

for  all  A  >  0.  Rearranging, 

isw(-°o.-a)<1-^1°-'> 

which  implies  that  (T^(<5X)},  and  therefore  {TQn(5x)},  is  tight. 

Tightness  implies  that  there  exists  a  subsequence  T^  ^  n(<5x)  that  converges  weakly,  i.e.  in  dis- 
tribution, to  some  distribution  n.  Since  Q(x,6)  is  continuous  in  x,  then  Tq{Tq^^{5x))  converges 
weakly  to  Tq(tt).  But  the  linearity  of  Tq  implies  that 

T^)+1(5x)-Tq(§x) 
TQ(TQMn)(8*))  =  ~ ^y- +TQMn)Vx) 

and  since  0(n)  — >  00  we  must  have  Tq{tv)  =  n. 

Recall  that  Tq  maps  probability  distributions  over  (-00, 1]  to  probability  distributions  over 
(-00, 1).  This  implies  that  tv  =  Tq(tt)  has  no  probability  mass  at  {1}.  Since  Tq  and  Tq  coincide 
for  such  distributions,  it  follows  that  7r  =  Tq(tt),  so  that  n  is  an  invariant  distribution  on  (-00, 1) 
under  Q.    ■ 
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The  argument  for  the  case  with  utility  bounded  below  is  very  similar.  Define  the  transition 
function  Q(x,8)  as  above,  but  for  all  i£E,  since  now  k'(v)  can  take  on  any  real  value.  If  utility 
is  unbounded  above  but  7  <  1  then  there  exists  an  upper  bound  vH  <  00  for  the  ergodic  set  for 
v.  Define  the  utility  level  v0  >  v  by  k'(vo)  =  1.  Next,  define  vl  to  be  the  minimum  of  the  policy 
function  gw  over  v  e  [fo,^],  which  is  defined  since  gw  is  continuous  over  this  compact  set.  If 
utility  is  bounded  above  then  let  v^  by  the  minimum  of  gw  over  v  e  [vq,  v),  which  is  defined  since 
\imv^y  gw(0,  v)  =  v.  In  both  cases,  since  gw  >  v  we  must  have  that  this  minimum  is  above  misery: 
vl  >  v.  Finally,  the  transition  function  is  continuous  with  Q(x,9)  <  k'(vi)  <  00.  The  rest  of  the 
argument  is  then  a  simple  modification  of  the  one  above  for  utility  unbounded  below,  with  k'iyLj 
playing  the  role  of  1  (things  are  actually  slightly  simpler  here,  since  no  continuous  extension  of  Q 
is  required). 

Proof  of  Proposition  3 

Consider  indexing  the  relaxed  planning  problem  by  e  by  setting  A  =  e_1,  with  associated  value  func- 
tion k(v;  e).  We  first  show  that  if  an  initial  distribution  ip  satisfies  the  condition  J  k'(v;  e)  dip(v)  =  0, 
then  the  solution  to  the  relaxed  problem  and  original  problem  coincide.  We  then  show  that  for  any 
initial  distribution  there  exists  a  value  for  e  that  satisfies  this  condition. 

Since  utility  is  unbounded  below,  we  have  k'{vt;  e)  =  Et_i  [l  —  \c!(uvt  (#*))] .  Applying  the  law  of 
iterated  expectations  to  (12)  then  yields 

t 


E_1[l-Ac'«(^)]  =  ^0fc'(U;e 


With  logarithmic  utility  c'(u)  =  c{u),  so  that  J  k'(v;e)dip(v)  =  0  implies  /E_i[q]  dtp  =  A-1  =  e 
for  allt  =  0, 1, . . .  The  allocation  is  incentive  compatible  by  Lemma  2  below,  and  applying  part  (c) 
of  Theorem  2,  it  follows  that  it  must  solve  the  original  planning  problem. 

Now  consider  any  initial  distribution  tp.  We  argue  that  we  can  find  a  value  of  A  =  e_1  such  that 
/  k'(v;  e)  dtp(v)  =  0.  The  homogeneity  of  the  sequential  problem  implies  that 

k(v;  e)  =  — ■ -r  log(e)  +  k  (v  -  — —  log(e);  1 

Note  that  k'(v  —  j^  log(e);  1)  is  strictly  increasing  in  e  and  limits  to  1  as  e  — ►  00,  and  to  —00  as 
e  — >  —  00.  It  follows  that 

J  k'(v]e)d^(v)  =  J  k1  (v  -  ^-log(e);l)  #(u)  =  0 

defines  a  unique  value  of  e*  for  any  initial  distribution  ip.  The  monotonicity  of  e*(ip)  then  follows 
immediately  by  using  the  fact  that  k'(-;l)  is  a  strictly  decreasing  function. 
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Proof  of  Lemma  2 

(a)  If  utility  is  also  bounded  below,  then  the  result  follows  from  part  (b).  So  suppose  utility  is 
unbounded  below,  but  bounded  above.  Then  k'(gw(9,  •))  is  continuous  and  Proposition  1  implies 
that  lim^-oo  k'(gw(9,v))  =  1.  It  follows  that  ma,xv  k' (gw (9 ,  v))  is  attained,  so  there  exists  a  vl  > 
-co  such  that  gw(9,v)  >  vl- 

(b)  If  utility  is  bounded  below,  the  result  follows  immediately  from  part  (c)  of  Theorem  2. 

(c)  Using  the  first-order  conditions  from  the  proof  of  Proposition  1,  one  can  show  that: 

d(u(6))   -  9' 

With  logarithmic  utility  this  implies  that  gu(9,v)  —  gu(9,v)  <  log  (9/9).  The  incentive  constraint 
then  implies  that  gw(9,v)  -  gw(9,v)  <  (9/(3) log(0/0)  =  A.  It  follows  that  v^1'1)  >  v^O1'1)  -  tA 
for  all  pairs  of  histories  9*~l  and  9l~l .  Then 

/^[^(cr'-1^-1))]  ^E-!^*-1)]  -PtA. 

From  part  (b)  of  Theorem  2  we  have  limt_00  /5*  E_1[ut(0'-1)]  =  0.  Since  lim^^  (3ltA  =  0,  it  follows 
that  limsup^^E-iMff'-1^*-1)]  >  0. 

(d)  If  7  >  0  then  the  bound  in  (13)  implies  that  k'  (gw  (9,  v))  >  1  -  (3/(3  and  the  result  follows 
immediately.  If  7  <  1,  then  we  can  define  n  =  1  —  (1  —  /3/$)/(l  —  7),  and  define  vh  by  k'iyn)  =  «• 
Then  for  all  v  <  v #  we  have  gw(9,v)  <  v.  It  follows  that  the  unique  ergodic  set  is  bounded  above 
by  vh-  We  can  now  apply  the  argument  in  (a)  so  there  exists  a,  vl  >  —00  such  that  gw  (9,  v)  >  vl- 

Proof  of  Lemma  3 

Part  (a)  is  immediate  since  by  continuity  of  the  policy  functions,  consumption  is  bounded.  For  part 
(b),  recall  that  J  k'(v)  dip*(v)  =  0  under  the  invariant  distribution  ip*.  If  utility  is  unbounded  below 
then  all  solutions  for  consumption  are  interior.  If  utility  is  bounded  below,  then  corner  solutions 
with  gc(9,v)  =  0  for  some  9  can  only  occur  for  low  enough  levels  of  v,  so  that  gc(9,v)  is  bounded, 
for  all  9  in  this  compact  set.  Recall  that  for  interior  solutions 


l-k'(v)  =  XE[c'{gu(9,v))]=XE[c'(u(c(gu(9,v)))) 

Applying  Jensen's  inequality  we  obtain 

E[c{gu(9,v))]  dP(v)X\  <  JE[c'(u(c(g^9,v)))))  #»  =  1. 
The  result  then  follows  since  c'(u(c))  is  an  increasing  function  of  c. 


c     u 
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